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collection fiber (60) and detected to form fluores- 
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DESCRIPTION 

COMBINED FLUORESCENCE AND REFLECTANCE SPECTROSCOPY 



BACKGROUND OF THlE INVENTION 

5 1. Field of the Invention 

The present invention relates generally to the fields of optical imaging. More 
particularly, it concerns apparatus and methods for combining fluorescence and 
reflectance specu-oscopy for the imaging of samples, including both in situ and ex situ 
imagining of body tissues. 

10 

2. Description of Related Art 

Cancer is one of the leading causes of death in the United States and in the 
world, in the United States alone, deaths frtiVn cancer are estimated to number 
560.000 in 1997 (American Cancer Society Online, Cancer Facts & Figures). 

15 Currently, diagnosis and treatment of cancer"foilow 1^ evaluation of 

directed biopsies. However, the tissue removal "rieicessitated by these techniques not 
only may alter the progression of the disease (Robbins and Kumar, 1984) but is also 
very costly. Improving the capability for in sitii monitoring of disease progression 
could greatly enhance the ability to detect and treat cancer and precancer (Kelloff et 

20 a/., 1992). 

A growing number of clinical studies have demonstrated that fluorescence 
specu-oscopy may be used to distinguish normal and abnormal human tissues in vivo 
in the sicin. head and neck, genito-urinary tract, gastro-intestinal tract, breast, and 
brain. It is well known that fluorescence intensity and lineshape are a function of both 

25 the excitation and emission wavelength in samples containing multiple chromophores, 
such as human tissue. A complete characterization of the fluorescence properties of 
an unknown sample requires measurement of a fluorescence excitation emission 
matrix, in which the fluorescence intensity is recorded as a function of both excitation 
and emission wavelength. The field of analytical chemisuy has exploited the 

30 fluorescence properties of different compounds: to identify and quantify them in 
mixtures. 
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Most clinical studies reported to date have measured fluorescence emission 
spectra at only a small number of excitation wavelengths (typically one to three) due 
to clinical requirements imposed on the size/speed and sensitivity of instrumentation. 
The choice of excitation wavelength has been based on factors which vary from study 
to study, but include laser availability and predictions of chromophores thought to be . 
present in normal and abnormal tissues and measurements of fluorescence excitation 
emission matrices (EEM) of normal and abnormal tissues i/i vitro. While in vitro 
measurements of tissue EEMs are feasible using commercially available scanning 
fluorimeters. several studies have demonstrated, that the optical properties of tissue 
change significandy when tissue is examined in vitro due in part to interruption of the 
blood supply, oxidation and small size of biopsies. Thus, in vitro studies to select 
excitation wavelengths are of limited value. 

Several recent studies have suggested that differences in optical properties, 
assessed using diffuse reflectance spectroscopy, may be. used to discriminate normal 
and abnormal human tissues w vivo in the urinary bladder and the skin. Furthermore, 
measuring both fluorescence and diffuse reflectance spectra may provide additional 
information of diagnostic value. 

A system capable of measuring spatially . resolved reflectance spectra and 
fluorescence excitation emission matrices in v/vo.^would remove limitations of many 
previous studies, potentially enabling prediction of excitation wavelengths tiiat 
provide greatest discrimination of normal and ^bnoraial tissues, as well as a better 
understanding of the relative diagnostic abiUty of. changes in absorption, scanering 
and fluorescence properties of tissue. Although fiber optic systems to record : 
fluorescence EEMs and reflectance spectra at a single spatial location have been 
reported, such systems have measured data from only a single spatial location, and 
have thus not been able to perform spatially resolved spectroscopy. Additionally, 
previous systems have not been well-adapted for in-vivo studies of various tissues. 

SUMMARY OF THE TlvfVFNTTnN 

In one respect, the invention is an apparatus for performing fluorescence and 
spatially resolved reflectance spectroscopy on a sample, and it includes a light source. 
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a monochromator. a reHectance illumination fiber. a Ouorescence excitation fiber, an 
imaging spectrograph, a fluorescence collection fiber, a reflectance collection fiber, 
and a detector. The monochromator is in optical communication with the light source. 
The reflectance illumination fiber is in optical communication with the light source. 
The fluorescence excitation fiber is in optical communication witii the . 
monochromator. The fluorescence collection fiber is in optical communication with 
tiie imaging spectrograph. The reflectance collection fiber is in optical 
communication with the imaging spectrograph > and is in spaced relation with tiie 
reflectance iUumination fiber. The detector is, in optical communication with the 
imaging spectrograph. 

In other aspects, the light source may include a Xe arc lamp. The 
monochromator may include a double monochromator. The detector comprises a 
thermo-elecuically cooled CCD camera. The fluorescence excitation fiber and tiie 
fluorescence collection fiber may be integral,; ..One or more of Uie fibers may be 
positioned flush with Uie sample. The apparatus: may also include a spacer positioned 
between one or more of the fibers and tiie sample. The reflectance illumination fiber, 
tiie fluorescence excitation fiber, the fluorescence collection fiber, and the reflectance 
collection fiber may define a fiber optic probe. The probe may be configured to be 
positioned within a Urocar. The probe may include a, center section and an outer 
section, and tiie fluorescence excitation fiber and the fluorescence collection fiber may 
be positioned in the center section, and tiie reflectance iUumination fiber and tiie 
reflectance coUection fiber may be positioned in ,the; outer section. The apparams may 
mclude a plurality of fluorescence excitation and collection fibers arranged in a 
circular bundle. The apparatus niay include a plurality of reflectance collection fibers 
defining a plurality of collection positions. The. plurality of collection positions may 
be spaced between about 0 and about 10 milUmeters from tiie reflectance illumination 
fiber. The reflectance collection fiber may defme. a collection position at about 180 
degrees relative to the reflectance illumination fiber. The reflectance collection fiber 
may define a collection position at about 90 degrees relative to Uie reflectance 
iUumination fiber. The reflectance collection fiber may defme a collection position at 
about 45 degrees relative to tiie reflectance Ulumination fiber. The apparatus may 
include one or more fibers in optical communication with tiie light source and 
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configured to illuminate the sample during operation of the apparattis. The apparatus 
may include a plurality of fluorescence excitation fibers arranged in one or more rows 
adjacent the monochromator. The apparanis may include a plurality of fluorescence 
excitation fibers arid a plurality of reflectance coUection fibers arranged in a single 
row adjacent the imaging spectrograph. The apparatus may include one or more . 
unconnected fibers interspersed with the plurality of fluorescence excitation fibers and 
the plurality of reflectance coUection fibers. The apparatus may include a fiber 
connected from the light source to the imaging spectrograph to monitor spectral 
output of the light source. The apparatus may include a conU-oUer coupled to the 
detector. 

In another respect, the invention is an apparatus for measuring fluorescence 
and spatially resolved reflectance spectra of a sample. The apparatus includes a light 
source, a monochromator. a fiber optic probe, an imaging specu-ograph. and a 
detector. The monochromator is in optical communication with the light source. The 
fiber optic probe is in optical communication, with tiie light source and widi the 
monochromator. The probe includes a plurality; of . fluorescence excitation and 
collection fibers in spaced relation and a plurality ;of reflectance collection fibers in 
spaced relation with a reflectance illumination fiber. . The imaging spectrograph is in 
optical communication with the plurality of fluorescence collection fibers and with the 
plurality of reflectance collection fibers. The detector is in optical communication 
witii the imaging spectrograph. 

In otiier aspects, tiie plurality of reflectance collection fibers and the 
reflectance illumination fiber may be positioned concentrically about tiie plurality of 
fluorescence excitation and collection fibers.. .At least one of tiie plurality of 
reflectance collection fibers may define a collection position at about 180 degrees 
relative to tiie reflectance illumination fiber. At least one of tiie plurality of 
reflectance collection fibers may define a collection position at about 90 degrees 
relative to tiie reflectance iUumination fiber. At least one of tiie plurahty of 
reflectance coUection fibers may define a coUeetion position at about 45 degrees 
relative to tiie reflectance illumination fiber. Thepluiality of collection positions may 
be spaced between about 0 and about 10 millimeters from tiie reflectance illumination 
fiber. The probe may include between twenty-one and forty-six optical fibers. 



wo 99/57529 PCT/US99/09768 

5 

In another respect, the invention is a method for combined fluorescence and. 
spatiaUy resolved reflectance spectroscopy of a sample. The method includes 
direcung radiation to the sample with a fluorescence excitation fiber, collecting 
radiation from the sample with a fluorescence collection fiber, directing the radiation 
from the sample to an imaging spectrograph and a detector, illuminating the sample - 
with a reflectance iUumination fiber, collecting Reflected li^ht from Uie sample witii a 
reflectance collection fiber in spaced relation^ with Uie reflectance iUumination fiber, 
and directing tiie reflected light from tiie sampk to an imaging spectrograph and a 
detector. 

In other aspects, the step of collecting reflected light may include coUecting 
reflected Hght from a pluraUty of collection positions witii a plurality of reflectance 
collecuon fibers. The step of collecting reflected light may include coUecting 
reflected light from the sample witii a reflectance collection fiber defining a coUection 
position at about 180 degrees relative to the reflectance illumination fiber. The step of 
collecting reflected light may include collecting reflected light from tiie sample witi, a 
reflectance collection fiber defining a coUection position at about 90 degrees relative 
to tiie reflectance Ulumination fiber. The step of cpUecting reflected tight may include 
collecting reflected light from tiie sample witii a reflectance collection fiber defining a 
collection position at about 45 degrees relatiye to ; the reflectance iUumination fiber. 
The sample may include ovarian, head and neck; or cervical tissue. The metiiod may 
also include analyzing spectral data from the detQctor to characterize tiie sample. The 
step of analyzing may include pre-processing the data and reducing a dimension of tiie 
data using principal component analysis. The step of analyzing may also include 
selecting one or more diagnostic principal components of the data and forming one or 
more algoritiims. The step of analyzing may also include forming one or more 
composite algoritiims. The step of analyzing may also include evaluating at least on 
of tiie algorithms using a cross-validation technique. 

In another respect, the invention is a method for combined fluorescence and 
spatiaUy resolved reflectance spectroscopy of a sample. The metiiod includes 
directing radiation to die sample witii a fluorescence excitation fiber. coUecting 
radiation from tiie sample witii a fluorescence collection fiber, directing tiie radiation 
from die sample to an imaging spectrograph and a detector, illuminating tiie sample 
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with a reflectance illumination fiber, collecting reflected light at a plurality of 
collection positions from the sample with a plurality of reflectance collection fibers 
arranged in spaced relation, directing the reflected light from the sample to an imaging 
spectrograph and a detector to produce spectral data, pre-processing the data, and 
reducbg a dimension of the data using principal component analysis. 

The method may also include selecting one or more diagnostic principal 
components of the data and forming one or more algorithms. The method may also 
include forming one or more composite algorithms. The method may also include 
evaluating at least one of the algorithms using a cr()ss-validation technique. 

In another respect, the invention is a method for analyzing spectroscopy data to 
define an optimized reduced data set. The method includes pre-processing the 
spectroscopy data, reducing a dimension of the spectroscopy data using principal 
component analysis, and selecUng one or more diagnostic principal components of the 
spectroscopy data. 

In other aspects, the spectroscopy data may include combined fluorescence and 
spatially resolved reflectance spectroscopy data. The step of pre-processing may 
include normalization of the spectroscopy data/ The step of pre-processing may 
include mean scaling the spectroscopy data. The step of pre-processing may include 
calculating one or more derivatives on the spectroscopy data. The method may also 
include eliminating redundant data from the spectroscopy data. The method may also 
include forming one or more algorithms and evaluating at least one of the algorithms 
using a cross validation technique. The method inay also include forming one or more 
composite algorithms. 

Applications for the methods and apparatus described herein are vast and ^ 
include, but are not limited to. analysis and detection of disease including cancers and 
pre-cancers (such as cervical, head and neck, colon, lung, esophageal, ovarian) and 
atherosclerosis. Applications also include industry, including, but not limited to. the 
semiconductor industry. 
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The foUowing drawings form part of the present specificaaon 
to fiirther demonstrate certain aspects of the present invention. The invention may be 
better understood by reference to one or more of these drawings in combination witii 
the detailed description of specific embodiments presented herein. 

FIG. 1 Block diagram of a Fast EEM system according to one embodiment of 
the present disclosure. 

FIGS. 2A and 2B Probe output at 332 nm according to one embodiment of 
the present disclosure. 

FIG. 3 Inside of a light source according to one embodiment of the present 
disclosure. 

FIG. 4 Outside connectors of the Ught Source according to one embodiment 
of the present disclosure. 

FIGs. 5 A and SB Comparison between the monochromator and the spectral 
lamp output 

FIGs. 6A and 6B A probe according to the present disclosure showing 
fluorescence excitation fibers, fluorescence collection fibers, a quartz rod, a 
reflectance excitation fiber, and reflectance collection fibers. 

FIG. 7 Probe according to the present disclosure showing fluorescence fibers, 
a quartz rod. reflectance fibers, illumination fibers, a protection shield, and a quartz 
shield. 

nCS. 8A and 8C Tip of a probe according to tiie present disclosure showing 
illumination of i) reflectance ii) fluorescence and iii) Ulumination fibers. 

FIGS. 9A and 9B Monochromator and spectrograph connector with 
fluorescence and reflectance collection fibers according to one embodiment of the 
present disclosure. 

nG. 10 Probe including fiber connectors according to one embodiment of the 
present disclosure. Shown are visual illumination fiber 113. reflectance excitation 
fiber 1 15. fluorescence excitation fiber 1 17. and reflectance collection position 1 19. 

FIG. 11 Correction factors for the specti-ograph. 
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FIG. 12 Schematic of Binning techniques: On chip binning (left). On chip 
and software binning (right). 

FIG. 13 Main screen of a Fast-EEM user interface according tt) one 
embodiment of the present disclosure. 

FIG. 14 System block diagram showing a variable excitation light source, a . 
fiber optic delivery and collection probe, and a spectral mulrichannel analyzer 
according to one embodiment of the present disclosure. 

FIGS. ISA - 15D (left) Schematic diagram of the distal ends of the probe: 
[a] outer shaft, {b] fluorescence excitation and emission fibers, [c] reflectance 
collection and illumination fibers, [d] mixing element, [E] reflectance excitation fiber. 
[1-3] reflectance collection locations. (Right) Schematic diagram of the proximal ends 
of the probe. 

FIG. 16A Simulated HEM with peak shifting in [1] excitation wavelength [2] 
and emission wavelength. r 

FIGS. 16B-1 to 16B-6 Simulated EEM with peak shifting in [1] excitation 
wavelength [2] and emission wavelengtii. Calculated Xay and may for tiie simulated 
EEM, x>v is sensitive to changes in die excitation position of the peak and may is 
sensitive to the emission position. 

FIG. I7A EEM of Rhodamine standard solution. 

FIG. 17B EEM of an FAD and microspheres-based tissue phantom measured 
using a FastEEM system. 

FIGS. 18A - 18D (A) Emission spectra at 360 nm excitation of the 
Rhodamine calibration standard measured with the FastEEM system and SPEX 
Fluorolog n fluorimeter. (B) Emission specu-a at 360 nm excitation of the scattering 
tissue phantiiom containing FAD and polystyrene microspheres measured witii tiie 
FastEEM system and SPEX Huorolog D fluorimeter. (C) Emission spectra at 450 nm 
excitation of the Rhodamine calibration standard measured with tiie FastiEEM system 
and SPEX Huorolog fi fluorimeter. (D) Emission specU-a at 450 nm excitation of the 
scattering tissue phanthom containing FAD and polystyrene microspheres measured 
with tiie FastEEM system and SPEX Fluorolog H fluorimeter. 

FIGS. 19A and 19B In-vivo fluorescence measurements witii the FastEEM 
system: (A) Fluorescence EEM of a normal site of the tongue. (B) Fluorescence EEM 
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Of a diseased site of the tongue, containing a moderately differentiated squamous cell 
carcinoma. 

FIGS. 20A - 20G Fluorescence emission spectra of normal and moderately 
differentiated squamous cell carcinoma of the tongue from Figure 6. The spectra were 
normalized to the peak fluorescence at 350 nm excitation, (a) Fluorescence emission . 
spectra at 350 nm excitation, (b) Huorescence emission spectra at 410 nm excitation, 
(c) Fluorescence emission spectra at 460 nm excitiion. 

FIGS. 21A and 21B Emission and excitation autocorrelation vectors of 
normal and moderately differentiated squamous cell carbinoma of the tongue from 
HGS.IS. (A) Emission autocorrelation vectors. (B) Excitation autocorrelation vectors. 

FIGS. 22A - 22C. Reflectance measurements of normal and moderately 
differentiated squamous cell carcinoma of the tongue at tiiree different separations 
from the source fiber. (A) Position 1, 1.1 mm separation. (B) Position 2. 2.1 mm 
separation. (C) Position 3, 3 mm separation. > . ; , . , 

FIGs. 23A - 23C A schematic of the portable fluorimeter used to measure 
cervical tissue fluorescence spectra at three excitation wavelengtiis. 

FIG. 24 A schematic of formal analytical process used to develop the 
screening and diagnostic algoriUims. The text; in the dashed-line boxes represent 
mathematical steps implemented on the spectial data and the text in the solid line 
boxes represent outputs after each raatiiematical step (NS - normal squamous. NC - 
normal columnar, LG - LG SIL and HG - HG SIL). 

HGS. 25A - 25C (a) Original and correspondmg (b) normalized and (c) 
normalized, mean-scaled spectra at 337 nm excitation from a typical patient. 

FIGS. 26A - 26C (a) Original and corresponding (b) normalized and (c) 
normalized, mean-scaled spectra at 380 nm excitation from tiic same patient 

FIGS. 27A 27C (a) Original and corresponding (b) normalized and (c) 
normalized, mean-scaled spectra at 460 nm excitation from the same patient. 

FIG. 28 A plot of the posterior probability of belonging to the SJL category of 
all SILs and normal squamous epithelia from tiie. calibration set. Evaluation of tiie 
misclassified SILs indicates that one samples with GIN m. two with CIN II. two witii 
CIN I and two with HPV are incorrectiy classified. . 
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FIG. 29 A plot of the posterior probability of belonging to the SIL category of 
all STLs and normal columnar epithelia from the calibration data set. Evaluation of the 
misclassified SILs indicates that three samples with CIN E. three with CIN I and one 
with HPV are incorrectly classified. 

FIG. 30 A plot of the posterior probability of belonging to the HG SIL 
category of all SILs from the calibration set. Eyduation of the misclassified HG SILs 
indicates that three samples with CIN m and three with CIN are incorrectly classified 
as LG SILs; five samples with CIN I and two with HPV are misclassified as HG SIL. 

nCS. 31A - 310 Component loadings (CL) of diagnostic principal 
components of constituent algorithm (1), obtained from normalized spectra at (a) 337 
(b) 380 and (c) 460 nm excitation, respectively. 

FIGS. 32A - 32C Component loadings (CL) of diagnostic principal 
components of constiment algorithm (2). obtained from normalized, mean-scaled 
spectra at (a) 337 (b) 380 and (c) 460 nm excitaUon. respectively. 

FIGS. 33A - 33C Component loadings (CL) of diagnostic principal 
components of consfituent algorithm (3), obtained from normalized spectra at (a) 337 
(b) 380 and (c) 460 nm excitation, respectively. 

FIGS. 34A-34D Plots of Frequency of occurrence vs. emission wavelength 
in top 25 performing combinations of three wavelengths: (a) ESL=65%, (b) 
ESL=75%. (c) ESL=85%, and (d) ESI^95% 

FIG. 35 Fluorescence emission specu-a normalized by the peak intensity of 
the concatenated vector for all 62 sites at 350. 38Q and 400 nm excitation. Red lines 
indicate histologically cancerous, green Unes indicate histologicaUy dysplastic, and 
blue lines indicate visually and/or histologically normal sites. 

FIG. 36 Plot of the only eigenvector of diagnostic importance at ESL = 65% 
for wavelength combination (350 380 400) (lower line at vector index=200) and the 
corresponding component loading (upper line at vector index=200). 

FIG. 37 Plot of emission vector for a wavelength combination of three 
excitation wavelengths (350. 380. 400 nm) normalized by the peak intensity of each 
emission spectra. 

FIGS. 38A - 38C Refiectance spectra (A), first (B) and second derivation (C) 
for position one. 
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FIGS. 39A - 39C Reflectance specti-a (top), first (middle) and second 
derivation (bottom) for position two. 

FIGS. 40A - 40C Reflectance • spectra (top), first (middle) and second 
derivation (bottom) for position three. 

FIGS. 41 A - 41C Average reflectance spectra (top), first (middle) and second 
derivation (bottom) for position one. Error bars show standard deviation. 

FIGS. 42A - 42C Average reflectance spectra (top), first (middle) and second 
derivation (bottom) for position two. Error bars show standard deviation. 

FIGS. 43A - 43C Average reflectance spectra (tdp), first (middle) and second 
derivation (bottom) for position three. Error bars show standard deviation. 

FIGS. 44A - 44C p values comparing the mean intensity, mean first and 
second derivatives of normal tissue versus abnormal tissues, at source detector 
separation 1 (top), 2 (middle) and 3 (bottom). 

FIGS. 45A - 45C p values comparing , the mean intensity, mean first and 
second derivatives of normal tissue versus dysplasUc tissues, at source detector 
separation 1 (top), 2 (middle) and 3 (bottom). 

FIG. 46 Scatter plot of the second derivative at 430 nm for position 2 vs. the 
second derivaUve at 495 nm for position one. The straight line represents an algorithm 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
80% and a specificity of 85%. 

FIG. 47 Scatter plot of the second derivative at 45.0 nm for position 1 vs. the 
first derivative at 510 nm for position three. The straight line represents an algorithm 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
80% and a specificity of 82%. 

FIG. 48 Scatter plot of the second derivative at 410 nm for position 1 vs. the 
first derivative at 510 nm for position three. The straight line represents an algoritiun 
to separate normal findings from dysplasias and cancers, and results in a sensitivity of 
70% and a specificity of 75%. 



DESCRIPTI ON OF ILLUSTRATIVE EMBODIMENTS 

HG. 1 shows one embodiment of an apparatus 10 according to the present 
disclosure. The apparatus is adapted to measure boUi reflectance and fluorescence 
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data, and may be refeired to as a Fast-EEM system (where EEM stands for excitation 
emission matrix) system. Fast EEM system 10, in one embodiment, may include four 
main components, although those having skill in the art will recognize that more or 
fewer components may be udized: The compbiiehts are: (a) an excitation source 20, 
which may include an arc lamp 22 and a monochromator 24 for monochromatic and 
broad band excitation, (b) a fiber optic probe 30, which may be configured to deliver 
excitation light to and coUect remitted fluorescence from a sample 60, (c) a detection 
apparatus 40, which may include a filter wheeU an imaging spectrograph 42, and a 
CCD camera 44 and that spectrally resolves a collected signal, and (d) a control unit 
50, which may be a personal computer used to run Fast EEM system 10 and to acquire 
data. 

Excitation source 20 

The light source 22 for Fast EEM system 10, which may provide both quasi- 
monochromatic excitation for fluorescenc^ Md broad band iUumlnation for 
reflectance, may be. in one embodiment, a 150 W ozone free Xe arc lamp (Spectral 
Energy Corp., Westwood NJ) with a spherical rear reflector. 

A condenser system including two piano' convex quartz lenses may be used to 
couple light into a monochromator 24. With the benefit of the present disclosure, 
those having skill in the art will understand that any optical filter or device suitable for 
creating bandpass filtered light may be used for monochromator 24. In one 
embodiment, monochromator 24 may be a single monochromator. A manual shutter 
(not shown) may be located between condensing optics and monochromator 24 and 
may be closed to prevent fluorescence excitation light from reaching sample 60 during • 
reflectance measurements. The scanning speed of monochromator 24 may be. in one 
embodiment, about 10 nm/sec. Light may be coupled from tiie output slit of 
monochromator 24 into probe 30 via a fiber optic adapter (Spectral Energy. GMA 
257) (not shown) that includes a quartz plano-convex lens and a 5X quartz 
microscope objective. The light passing through the objective may be focused to an 
appropriate shape to fill one or more fibers of probe 30. In one embodiment, light 
passing through the objective may be focused onto a vertical line onto twenty-five 
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fibers of probe 30. the twenty-five fibers being arranged in two columns and placed at 
the focal plane of the objective (See FIG. 9A). 

A reflectance excitation fiber (See. e.g., FIG. 6) may be coupled to the lamp 
housing of light source 22 via a micropositioner (not shown). Broadband light exiting 
the lamp housing through an exiting hole may be coupled to a reflectance illumination 
fiber using a quartz plano-convex lens (NA=0.24). A five position illumination filter 
wheel (not shown) placed between the lamp and the lens may include three long pass 
filters with 50% transmission at 295 nm, 515 nrn and 715 nm, respectively. One of 
the filter positions may be blocked and may act as a shutter to prevent white light from 
reaching sample 60 during fluorescence measurements. 

In another embodiment, the light source 22 for Fast EEM system 10. which 
may provide both quasi-monochromaUc excitation for fluorescence and broad band 
illumination for reflectance, may be an ozone-free 450 W Xe arc lamp (FL-1007, 
Instruments S A, Edison, NJ). ,. j 

Light used for monochromatic fluorescence excitation may be focused with a 
spherical mirror (not shown) onto the input s\il of monochromator 24. In this 
embodiment, monochromator 24 may be a double monochromator (DDD 180, 
Instruments SA. Edison, NJ). A spherical rear reflector (not shown) may redirect light 
that is exiting the lamp in the opposite direction into the opposite direction onto the 
spherical mirror. The slit may be covered .\^ith a sapphire window, which may 
prevent hot air from flowing out of the lamp housing into the monochromator 24. A 
double monochromator may be chosen for monochromator 24 because of its higher 
stray light rejection compared to a single monochromator. A double monochromator 
may be configured in additive mode; which means Uiat the dispersions of tiie two 
holographic gratings are added. Stray light in such a configuration may be so slight as 
to be negligible. The focal length of each of the two monochromators may be about 
18 cm and the high throughput may be f/3.9. The two holographic gratings may have 
about 1200 grooves/mm and may be blazed at 500nm. In this embodiment, the 
system's maximal resolution may be about 0.3 nm witii an accuracy of about 0.5 nm. 
The scanning speed in tiiis emboidiment may be about 150nm/s, and tiie usable 
wavelengUi range may be from about 300 to about 1000 nm. Wavelength scanning 
may be achieved with a direct digital stepper-motor with a worm drive mechanism 
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(not shown). Three computer-controlled slits (entrance, middle, and exit) may be 
opened between 0 and 7 mm in steps of 12.5 ^im. In one embodiment, a slit-width of 
about 2 mm may be chosen for both the entrance and the exit slits. The middle slit 
twice may be opened as wide as the entrance and the exit slit to achieve an optimal 
performance. These settings guaranteed a spectral resolution of about 6 nm FWHM. 

no. 2 shows a spectrum taken at 332 nm by coupling light through probe 30 
through a fiber optic adapter into a scanning spectrofluorimeter (SPEX, Huorolog II, 
Edison, NJ). An emission scan from 300 nm to 600 nm was performed to collect the 
relative intensity of the probe output. 

In one embodiment, the coupling of light into a fluorescence excitation bundle 
(See. e.g., FIG. 6 and FIG. 7) was done using a fiber-optic interface kit (220F, 
histruments SA, Edison, NJ). Two plano-convex lenses (different focal lengths) may 
be matched to different NAs of the exit sUt and of a fiber bundle of probe 30 to 
minimize coupling losses. A computer-controlled i shutter (LS6, Vincent Associates, 
Rochester, NY) may be mounted in front of the probe connector to block fluorescence 
excitation light during reflectance measurements, • . 

Ught source 22 may be customized to prpvide \yhite Ught output. White light 
may be needed (a) for reflectance measuremems,; (b) , for visual observation of a 
measurement site by a physician, and (c) to monitor -the lamp output. 

no. 3 shows a top view drawing of the inside of the lamp housing according 
to one embodiment. Light bulb 25 and ray traces (dashed lines) for the 
monochromator light are shown. In one embodiment, the opUmal solution to provide 
white light output to the outside of the housing involved the use a bundle of quartz 
fibers. One biconvex lens, mounted in a custom-made rack inside the lamp housing, 
coupled light into a bundle of three 600 \im and one 50 ^un high-temperature quartz 
fibers (Thermocoat, Fiberguide Industries, Stirling, NJ). The light rays are indicated 
by the dotted line in FIG. 3. These fibers transported white light to four connectors on 
the outside of the housing (See HG. 4). The first connector CI may provide 
excitauon light used for reflectance measurements. The five-position illumination 
filter wheel described previously may be placed between two biconvex quartz lenses 
(focal length = 20 mm). The second connector G2 may be equipped with one quartz 
lens (focal length = 20 mm) that focuses light onto the illumination fiber bundle. A 
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second shutter (LS6. Vincent Associates, Rochester, NY) may be placed between the 
connector and the lens, .which may be closed during data acquisition and may 
otherwise be held open to deliver light to the illumination fibers of probe 30. The 
third 600 fiber output C3 may be used for other purposes, or not at all. The 50 m 
fiber output C4 may couple light into a fiber that is directly connected to imaging ■ 
spectrograph 42 to record the lamp spectrum for every measurement. In one 
embodiment, however, this option is not used. 

FIG. 5 illustrates the power output of two monochromatic illumination 
systems (one using a 150 W ozone free Xe arc lamp and the other using an ozone-free 
450 W Xe arc lamp). The output was measured through probe 30 using a calibrated 
power meter (818-UV. Newport, Irvine, CA) and represents the flux (W) that is 
provided to sample 60. which may be a tissue sample. Above about 400 nm, an 
improvement in power of a factor of four is noticeable. Note that the lamp performed 
poorly below 400 nm. The light output at about^sao nm is only about 20% of the peak 
performance at 460 nm. The low UV output miy be due to the fact that lamp is an 
ozone-free model. The light bulb is made out of UV blocking glass since Ozone is 
mainly produced in the surrounding air within this, spectral region. In order to have a 
useful S/N ratio prolonged exposure times in the Spectral region below 400 nm may 
become a necessity. 

Probe 30 

The combined spatial reflectance and fluorescence probe 30 of the present 
disclosure may be built to meet the foUowing criteria. First, the tissue volume probed 
by the reflectance and fluorescence measurements may overlap. Second, because the 
collected fluorescence intensity may be typically three orders of magnitude lower than 
the reflectance intensity, a detector with a high dynamic range may be required. 
Weakening the reflectance excitaUon light by using a smaller excitaUon fiber or using 
a number of fluorescence excitation fibers may. however, alleviate this problem. 
Third, the total diameter of the probe may be small enough so that it is possible to 
cover an area of only one tissue type; for example, dysplastic lesions around a tumor 
are likely to be only a few millimeters wide. Fmally, a probe 30 small in diameter 
may give the opportunity to use it for minimal invasive surgeries through trocars. 
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According to one embodiment, probe 30 may fit into a trocar. In one embodiment, it 
is designed to fit into a trocar (Reflex STR, 5 mm. Richard-Allan Inc.) that is 
commonly used in the Gynecology Department at The University of Texas M. D. 
Anderson Cancer Center, Houston, TX, (lir MD ACQ. 

One embodiment of a combined reflectance and fluorescence probe 30 ... 
includes a total of 21 quartz fibers (200 Mm core diameter, NA = 0.22). With the 
benefit of the present disclosure, however, those of skill' in the art will recognize that 
more or fewer fibers may be used. Additonally. although the present disclosure refers 
to embodiments of a probe including "fibers", it wUl be understood that any chamiel 
suitable for transmission of light may be substituted therewith. In one embodiment, a 
ring of twelve fluorescence collection fibers 70 surround a circle of seven 
fluorescence excitation fibers 72. In one embodiment (not shown), at least one 
fluorescence fiber may be an integral fluorescence excitation and collection fiber. At 
the distal end of fluorescence excitation and coUeetion fibers may be a quartz rod 
(about 1.5 mm diameter, about 7 mm thick) 74 located to ensure an overiap at the 
sample surface between fluorescence excitation and collection fibers. One reflectance 
excitation fiber 76 and one reflectance collection fiber 78 (both about 90 ^mi core 
diameter) may be placed outside of the quartz rod and flush to the sample, which may 
be tissue, on opposite sides. The reflectance fibers may be about 1.7 mm apart from 
each other, and light may be scattered through, the; same tissue volume that is 
examined for fluorescence. 

In one embodiment, a probe 30 may have a total length of about 28 cm to 
about 35 cm. which allows the probe to pass a trocar shaft. With the benefit of the 
present disclosure, however, those having skill in the art wUl recognize that the probe 
30. and other components described herein, may be made of different size (and 
materials) according to need or desire. 

Turning to FIG. 7 and FIG. 8. it may be seen that the diagnostic portion of 
probe 30 may include forty-six optical fibers (about 200 ^m, NA=0.22) in two 
concentric sections. With the benefit of the present disclosure, however, those of skill 
in the an wiU recognize that more or fewer fibers may be used. The center bundle 80 
(See no. 7) may contain twenty-five fluorescence excitation fibers and twelve 
fluorescence collection fibers. At the distal end of the probe 30. these fibers may be 
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arranged randomly in central bundle 80 and may be placed in mechanical contact with 
a short piece (about 1.5 cm long) of thick quartz fiber 82. Light sent through this rod 
may be distributed over an examined area. The rod's length may be determined by the 
radius of the rod and the NA of the fibers and may be calculated by taking twice the 
radius and dividing it by the fiber NA. 

Nine fibers for illumination and collection of diffuse reflectance may be 
arranged in a ring around the fluorescence fibers (See element 84, HG. 7). Three 
collection fibers 86 may be located at about 180°, two fibers 88 and 90 may be located 
at about 90°. and two fibers 92 and 98 may be located at about 45° from the 
iUuminaUon fiber 94. A single collection fiber 96 may be placed directly beside the 
reflectance excitation fiber in to measure single backscattered light. Fibers 92 and 98 
may have a distance to the excitation source of about 1.4 mm, fibers 88 and 90 of 
about 2.4 mm, and fibers 86 of about 3.3 mm. The distal ends of the reflectance 
fibers may be flush with the tip of the central .fiber and placed in contact with the 
sample surface. 

For measurements that take longer . thap . about 30 s, an optical feedback 
mechanism for the probe operator may need to be provided to avoid a displacement of 
the instrument. Therefore, a third ring of seven fibers 100, with an offset of about 2 
cm (for a 28 cm probe) and about 5 cm (for a 35,cm probe) from the tip may be added 
for Ulumination puiposes. Probe 30 may have a, screw-on protection shield 102 at the 
tip of the probe. Specularly reflected light bety/een a quartz shield 104 and the probe 
30. however, may lead to an uncorrectable biasing of the probe performance, and 
therefore protection shield 102 may optionally not be used. A 3 0-minute soaking of 
probe 30 in a disinfecting solution like Cidex™ (Johnson and Johnson IncO alloWs the 
probe to be used in the sterile environment of an operating room. 

The arrangement of fibers at the monochromator 24 and the spectrograph 42 
connectors, according to one embodiment, are shown in HG. 9 The fluorescence 
excitation fibers 108 may be arranged in two rows for opumally filling by a 
rectangular output beam of the monochromator 24. The fibers on the spectrograph 42 
end may be lined up in a single row, as shown. Fibers 1 10 are fluorescence coUection 
fibers, and fibers 1 12 (represented by darkened circles) are the reflectance collecUon 
fibers. Because saturauon in one fiber location may bloom to adjacent pixels on the 
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detector, additional spacing, realized by unconnected fibers (Ulustrated by un- 
darkened circles), reduced this problem. In this embodiment, the spectrograph 
connector contains fiber 114 that may be connected directly to a white light output of 
light source 22, which may be a Xe lamp, to monitor the spectral output of the light 
source over time. 

FIG. 10 iUustrates an entire probe 30, according to one embodiment, including 
connectors and connecting fibers. Note that reflectance coUection fiber 94 (See HG. 
8), the position right next to the excitation fiber, may be intemipted by disconnecting 
SMA connector #2. This feature was created in this embodiment in case tfie directiy 
backscattered light signal was too strong and needed attenuation. 
Spectrograph 42 and filter wheel 

Imaging spectrograph 42. in one embodiment, may be a commercial imaging 
spectrograph (Chromex 250 IS, Albuquerque, NM). A grating of about 100 
grodves/mm, blazed at about 450 nm may be used. With the benefit of the present 
disclosure, however, tiiose of skill in the art will understand that any optical filter or 
device suitable for analyzing spectral content of light from one or mutiiple sources 
simultaneously may be used for imaging spectrograph 42. 

Light collected by fluorescence and reflectance fibers and the excitation light 
guided directiy from the source may be coupled through an 8-position, computer 
controlled coUection filter wheel (Optomechanics Research. Inc., Vail, AZ), into 
imaging specU-ograph 42. The filter wheel blocks the fluorescence excitation light 
from entering the spectrograph 42. The specti-ograph may contain a holographic 
grating blazed at about 380 nm with about 100 ^ grooves/mm. The fibers may be ^ 
projected onto an entrance slit (about 250 jim) to yield a spectral resolution of about 
7nm. 

The non-uniform spectral response of the system may be corrected as shown in 
FIG. 11. These correction factors may be determined from measurements of 
calibration sources; in the visible, a N.I.S.T traceable tungsten ribbon filament lamp, 
and in die UV, a deuterium lamp may be used (550C and 45D, Optronic Laboratories 
Inc., Oriando, FL). 
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Variations in the intensity of fluorescence excitation light source at different 
excitation wavelengths may be corrected using measurements of the intensity at each 
excitation wavelength at the probe tip using a calibrated photodiode (818-UV. 
Newport). 

CCD Camera 44 

A Uiermo-electrically cooled CCD camera 44 (Specti-asource HPC-1. Westiake 
Village, CA) may be operated at about -30° C and may be located at the back focal 
plane of the imaging spectrograph 42. Chip dimensions may be about 13.8 x 9.2 mm 
with 1536 X 1024 pixels (Kodak KAF-1600 gi^de 2). to yield a nominal spectral range 
of about 410 nm for a single grating position. Each fiber may take up about 40 pixels. 
The dark currem of the CCD chip, in this cmbbdimeni. was specified and confirmed 
as 0.25 electrons/pixel/sec when operated at -30° C. Quantum efficiency of the 
lumogen-coated chip may range from a peak of about 40% at about 550 nm to a low 
of about 15% at about 250 nm. 
Binning Pixels 

The HPC-1 CCD camera 44 allows a user to perform on-chip binning of 
pixels. Binning means that neighboring pixels may be added togetiier to represent 
only one data point. This feature is atti-active for at least two reasons: (1) it allows a 
reduction in the time required to read data from the chip, and (2) it increases the 
signal-to-noise ratio by reducing the effective read out and shot noise. 

Although a useful feature, excessive binning may diminish the resolution of 
the system. Furtiiermpre, because tiie fu^ well capacity of tiie pixels and shift register 
is limited, it is possible to exceed this capacity by either grouping too many pixels 
together or by encountering an unexpectedly strong signal (blooming). When 
blooming occurs, charge in excess of tiie full well capacity of a capacitive element 
may spill into adjacent pixels. This can essentially fill tiie pixels witii charge and 
render them unavailable for signal detection or perhaps give a false indication of 
signal where none exists. 

In one embodiment, binning was only electronically implemented in tiie spatial 
direction on iht chip. The 12 fluorescence excitation fibers filled 480 pixels and were 
all binned togetiier. For the reflectance excitation, a combined binning in hardware 



wo 99/57529 PCT/US99A)9768 

20 ; 

and software was used in one embodiment. This technique had two advantages: ( 1) it 
increased the dynamic range compared to a full binning in hardware, and (2) it 
increased the data transfer rates as compared to non-binned data. HG. 12 shows the 
two different binning techniques. 

In one embodunent, the camera 44 and the readout electronics did not operate . 
in a rehable manner. Long-term testing showed that counts on every pixel can vary 
from exposure to exposure when the shutter remains closed. A DC offset variaUon on 
the chip, resulting in an average count of 700/pixel/s to 1500/pixel/s was monitored 
during a 12 hour period. The origin of this behavior was expected to be either a 
cooling problem of the CCD camera 42 or an unstable DC offset supplied to the A/D 
convener. In an attempt to cure at least some of the problems, a higher number of 
pixels were digitized that were actuaUy physically present. The count of these fake 
pixels reflected the DC offset of the signal and was found to be independent of the 
detector temperature. Testing showed that the count, of these fake pixels varied the 
same way as the real pixels did. In this embodiment, monitoring of the background 
was required at every single measurement, since a low fluorescence signal may lie in 
this range. The background could be subtracted from the acquired data. In the 
embodiment, another problem was discovered: with.th? readout of the chip. The first 
electronically binned line that was read out .was alwa>^. corrupted and had to be 
discharged. This meant that the double amount of pixels were binned into two 
columns from which the first corrupted one was dumped. 
Software and Control .'i - 

In one embodiment. National Instruments Labview Version 3.0 (Austin, TK) a . 
graphical programming development environmem based on the G (Graphic) 
programming language may be used to control F;^t EEM system 10. The platform for 
the control software may be any suitable control device or computer 50. In one 
embodiment, a laptop 486/75 MHz personal computer with docking stauon (Austin 
Inc., Austin. TX) was used as computer 50. Communication with the excitaUon 
monochromator may be provided via an RS-232 control module that is interfaced to 
the COM port of the docking station of computer 50. A camera control card may be 
mounted in the docking staUon. The imaging spectrograph 42 may be operated using 
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a National Instruments GPIB IEEE-488 board that is also located inside the docking 
station of computer 50. 

In another embodiment, a desktop computer chosen (Optipte^ 233GXa, 
Dell Computer Corporation, Round Rock, TX) ' equipped with a Windows95™ 
operating system as computer 50. All mentioned cards in this embodiment may be . . 
connected to Ae ISA-bus of computer 50. A double monochromator 24 and 
spectrograph 42 controls may be connected by a GPIB IEEE-488 interface (AT- 
GPm/TNT, National Instalments. Austin, TX): The two shutters and the filter wheel 
may be controlled with a digital I/O card (PC^pi6-24. National Instruments. Austin. 
TX). The CCD camera 44 may have its own ISA-bus interface card. The readout rate 
of the chip in this embodiment was greater than about 65,000 pixels/s. This gave a 
readout time of about 24 s for the whole chip if no binning was used. In this 
embodiment, no on board RAM was available to buffer acquired data. 

In one embodiment. Labview V.5.0 (National .Instruments, Austin. TX) was 
chosen as the software to control the entire Fast EEM system 10. In tiiis embodiment, 
the goal of software developmem was to create aii easy to use interface tiiat made the 
system conu-ollable by an operator with basic computer knowledge after only a few 
days of training. 

Such software may be designed using a smaU number of basic sub-Vi's (Vi: 
virtual instmment. National Instruments' expression for software units). Operator 
interaction may be minimized to avoid human errors.; Automation of file saving and 
auto-naming of saved files may be implemented tO prevent loss of data by mislabeling 
or accidentally overwriting certain files. Such, automation may also speed up the 
interaction time of an operator with the software between ineasu^ 

In one embodiment, stored fluorescence .data was loaded immediately after 
storage and could be visually inspected in the ceoter of the screen. Such a routine may 
be added as a quality-ensuring featiire. and it may also^ help to prevent data loss caused 
by saving errors or misahgnment of Uie system if tiie operator was experienced in 
interpreting tiie acquired data. 
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Software Structure 

HG. 1 3 shows a main user interface according to one embodiment from which 
the Fast EEM system 10 may be controlled. With the benefit of the present 
disclosure, those having skill in the art will understand that there are numerous ways 
in which system 10 may be controlled and that the interface shown in HG. 13 is but 
only one of those ways. Other user interfaces may be implemented as is known in the 
art. In HG. 13. the center displays show four spectra of the last fluorescence 
measurement (top graph) and the acquired reflectance data (bottom). The excitation 
wavelengths of the displayed spectra may be changed online. Around this screen, 
different buttons may be present, which allow access to the certain main features. 

In the configuration component of the software interface illustrated in HG. 1 3, 
all the configuraUons were accessible and controllable. In the 'Savmg parameter' sub 
program, a patient number and the directory path may be defined. The integration 
time for the individual exposures and the settings of the CCD camera 44 may be 
stored in the corresponding subroutine. The Spectrograph setUngs may be changed in 
the 'Chromex'-Vi. The buttons for the mercury calibration, the lamp monitoring, and 
the power output of the probe may also be associated with the configuration settings 
of the software. 

In regard to acquiring date, individual switches for starting the background and 
the standards measurements may be placed on the left side of the spectra display. The 
fluorescence, reflectance and combined reflectance and fluorescence measurements 
may be initiated in the 'Main Measurements* box, . Naming of files with the acquired 
data may be dependent on which kind of measurerrient is chosen. In one embodiment, 
no manual naming of files by the operator was necessa^. 

Many additional features may be added to |the software and user interface. For 
example, an image of the whole CCD chip with all possible settings and binnings may 
be achieved. The monochromator 24 may be moyed to any desired wavelength. The 
center wavelength of the spectrograph 42 may be set manually, too. The camera's 44 
exposure time may be adjusted, and it may possible to choose if the shutter of the 
spectrograph 42 should open or if it should remain closed to image the dark current. 
AnoAer sub Vi may be designed to change all the settings of the monochromator 24. 
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such as wavelength, and slit width. Emission and reflectance spectra may be loaded 
and visuaUy compared on the screen. It may be possible to turn on and off the probe's 
iUumination light from the main screen. It shall be understood that none of these extra 
features need influence the settings for the main measurements. Default values may 
always be restored when measurements are started. When exiting the software, a 
protocol file may be created that contains all the important settings, the date, file 
names and the name of the operator. In one embodiment, about 1 12 individual Vi's 
were created to design a reliable, easy-to-use and fault-proof system, although it will 
be understood that more or fewer routines may be implemented according to the needs 
or desires of the user. In other embodiments, for instance, a simpler or more 
complicated user interface may be easily implemented as is known in the art. 
Temporal Performance 

Table 2.1 compares the temporal performance of the two embodunents of Fast 
EEM systems described above - one utilizing a 150 W ozone free Xe arc lamp, single 
monochromator. and twenty-one fiber probe (Embodiment A); and the other system 
using a 450 W ozone free Xe arc lamp, a double monochromator. and a forty-six fiber 
probe (Embodiment B). Overall, the time to obtain a complete EEM in Embodiment 
B between 330 nm and 500 nm excitation in steps of 10 nm was cut down to less than 
45 s. a temporal improvement of 105 seconds over Embodiment A. To obtain the 
same amount of counts on the CCD chip, the exjiosurc times may be cut down from 
1500 ms to 200 ms. depending on the excitation wavelength. An exposure time of 
375 ms may be expected since the amount of light delivered to the tissue may increase 
by a factor of 4. The alignment on the emission side was improved in Embodiment B. 
so that the throughput was almost twice as much as before. The monochromator' s 
scanning speed may be decreased from 34 s for an entire scan and resetting to the 
starting wavelength to less than 3 s. A faster computer and the use of a 32-bit 
operating system in Embodiment B cut down the computation time by almost 50%. 
However, it still required about 2 s per exposure to transfer the data from tiie camera 
to tiie computer. This value adds up to 42 s. 75% of the whole data acquisition time. 
This handicap may be further improved by replacing the readout electronics of the 
CCD chip. The control of the illumination shutter, a new feature of the system, did 
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not add any extra time to the measurements. The shutter opened and closed in less 
than 5 ms. 

In embodiment B, reflectance; measurements may be sped up by u^^ 
200 ^m fiber for the excitation light instead of a 80 ^m fiber, since more Ught is 
5 provided to the sample 60, which may be a tissue. A nidre intense white Ught output 
of the system may serve the same purpose. By using a different imaging specu-ograph 
42 with a grating with lower spectral dispersion, a wider spectrum may be covered on 
the CCD chip. To cover the desired spectral range for reflectance measurements, only 
two (instead of three) sub-range exposures may bei necessary. OveraU data acquisition 
10 time over 2 wavelength ranges and four positions may be achieved in 31 s m 
Embodiment B, which is about three times faster than that in Embodiment A, in 
which only 3 spatial positions had to be exposed. 



wo 99/57529 



25 



PCT/US99/09768 



Table 2. 1 Comparison of Temporal Performance: 





Embodiment A 


CimDOuinient o 


Fluorescence 






Scanning time: 

2 X 500 nm - 330 nm 


2x 170nm/10nm/s=34s 


2 X 70 nm / 150 nm/s =2.7 s 


Exposure time 


18x 1.5 s = 27s 


20 exposures: 

£ = 6.0 s (see 3.1.2) 


Moving filter wheel 


8 X 1 s = 8 s 


8xls=8s 


Camera shutter, data 
transport 


18x4.5 s = 81 s 


21x2s = 42s 


illumination shutter 




< 1 s 




Z = 150 s 


Z = 53.7 s 


Reflectance 






Exposure time 


9 exposures: 27 s 


8 exposures: 6 s 


Camera shutter, data 
transport 


63 s 


25 s 




Z=90s 


Z=31s 



In summary, a combined reflectance and fluorescence measurement with the 
Embodiment B may be obtained in 85 s. about three times faster than with the 
Embodiment A. This temporal improvement may benefit the patient and may also 
minimize the chance that the physician moves the probe during measurements. 

The following examples are included to demonstrate preferred embodunents 
of the invention. It should be appreciated by those of skill in the art that the 
techniques disclosed in the examples which follow represent techniques discovered by 
the inventor to fiinction well in the practice of the invention, and thus can be 
considered to constitute preferred modes for its practice. However, those of skill in 
the art should, in light of the present disclosure, appreciate that many changes can be 
made in the specific embodunents which are disclosed and still obtain a like or similar 
result without departing from the spirit and scope of the invention. 
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EXAMPLE 1 

Fluorescence Excitation Emission Matrices of Human Tissue: A System for In vivo 
Measurement and Method of Data Analysis 

This example describes a Fast EEM system capable of measuring spatiaUy 
resolved reflectance spectra from 380-950 mn and fluorescence excitation emission 
matrices from 330-500 nm excitation and 380-700 nm emission in vivo. System 
perfonnance was compared to a standard scanning, spectrofluorimeter. This FastEEM 
system was used to interrogate human normal and neoplastic oral cavity mucosa in 
vivo. Measurements were made through a fiber optic probe and required about 4 
minutes total measuremem time. This example also presents a method based on 
autocorrelation vectors to identify excitation and emission wavelengths where the 
spectra of nonnal and pathologic tissues differ most. The FastEEM system provides a 
tool with which to study the relative diagnostic ability of changes in absoiption, 
scattering and fluorescence properties of samples, including tissue samples. 
Materials and Methods: 

FIG. 14 iUustrates a block diagram of a Fast EEM system 10 in accordance 
with the present disclosure. This system includes at least three main components: (1) 
an arc lamp 22. stepper motor driven monochromator 24 and filter wheel, which 
provides monochromaUc and broad band excitation. (2) a fiber opUc probe 30 which 
directs excitation light to the sample 60. whidi riiay be a tissue sample, and collects 
remitted fluorescence from, in this embodiment, one location and diffusely reflected 
light from, in this embodiment, three locations, and (3) a filter wheel, imaging 
spectrograph 42 and CCD camera 44 which detects the specially resolved reflectance 
and fluorescence signals. Excitation monochromator position, filter wheel position, 
spectrograph grating position, CCD operation and data acquisition are controlled 
using a laptop personal computer 50 mated to a docking station. The specifications of 
each sub-system are described below. 

The probe 30. illustrated in FIG. 15. included a total of forty-six optical fibers 
(200 ^m diameter. NA=0.2) arranged in two concentric bundles. The center bundle 
contained twenty-five fluorescence excitation fibers and twelve fluorescence 
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collection fibers. The proximal ends of the fluorescence excitation fibers were 
arranged in two vertical lines at the exit slit of the excitation monochromator 24 to 
maximize the couphng of the Ught into the sample. The proximal ends of the 
fluorescence collection fibers were arranged in a single vertical Une at 
of the imaging spectrograph 42. At the distal brid of the probe 30. the fibers that 
excite and collect fluorescence were arranged randomly in a central bundle and placed 
in contact with a short piece of a thick quartz fiber (2 mm diameter. 15 mm long, 
NA=0.2). The distal tip of this fiber was placed in contact with the sample surface 60.' 
and ensured that the area from which fluorescerice was collected was the same as that 
directly illuminated. 

The nine fibers for illuminaUon and collection of diffuse reflectance were 
arranged in a concentric ring around the thick quartz fluorescence measurement fiber. 
The distal ends of these fibers were flush with the tip of the central fiber and were 
placed in contact with tiie sample surface 60. White light from a port on the side of 
the lamp housing was coupled to the proximal end of a single illumination fiber (80 
Vm, NA 0.2). Photons that scatter tiirough the, tissue and exit tiie surface were 
collected at four different positions with seven collection fibers; three located 180° 
from the illummation fiber (3 mm distance), two located 90° from the illumination 
fiber (2.1 mm) and two located 45° from the aiumination fiber as shown (1.1 mm) 
(See HG. 15). The proxhnal ends of the reflectance collection fibers were situated at 
the top of tiie vertical Une of fluorescence coUection fibers, separated by dummy 
fibers as shown in FIG. 15. 

The light source 22 for the instrument, which provided both quasi- 
monochromatic excitation for fluorescence arid broad band illumination for 
reflectance, was a 150 W ozone free Xe arc larpp. (Spectral Energy Corp.. Westwood 
NJ) with a spherical rear reflector. A condenser system consisting of two plano- 
convex quartz lenses was used to couple light into monochromator 24. The primary 
condenser was 1.5 inches in diameter witii an aperture ratio of f/1.5. The secondary 
condenser was also 1.5 inches in diameter, but was masked to provide numerical 
aperture matching to the monochromator 24. A manual shutter was located between 
the condensmg optics and monochromator 24 and was closed to prevent fluorescence 
excitation hght from reachmg the sample 60 during reflectance measurements. The 
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monochromator 24 had an apenure ratio of f/3.6 (Spectral Energy. GM 252) and was 
used with an ion-etched holographic graUng (ISA. Edison. NJ. 240 nm blaze. 1180 
groovesAiun. dispersion = 3.3 nmAnm). An RS-^32 controlled stepper motor drove 
the monochromator 24 with a maximum stepping rate of about 400 step/sec (about 10 
nm/sec). A bandwidth of 6.6 nm was selected by setting the entrance slit of the 
monochromator to about 2.0 mm. Ught was cpupled from the monochromator 24 
into the probe 30 via a fiber optic adapter (Spectral Energy, GMA 257) consisting of a 
quartz plano-convex lens and a 5X quartz microscope objective. The light passing 
through the objective was focused onto a vertical line of 25 fibers in two columns, 
placed at the focal plane of the objective (See HG. 15). The reflectance excitation 
fiber was attached to the lamp housing via a micropositioner. Broadband light exiting 
the lamp housing tiirough an existing hole was coupled to the reflectance illumination 
fiber using a quartz plano-convex lens (NA=0.24). A five position illumination filter 
wheel placed between tiie lamp and the lens contained three long pass filters with 50% , 
transmission at 295 mn. 515 mn and 715 nm. Qne of tiie, filter positions was blocked 
and acted as a shutter to prevent white light from reaching the sample during 
fluorescence measurements. ■ i .^ . 

Ught collected by fluorescence and reflectance fibers was coupled through an 
8 position, computer conUroUed collection filter .wheel, into a Chromex 250 IS 
(Albuquerque. NM) imaging spectrograph 42 containing a holographic grating blazed 
at 380 nm with 150 grooves/mm and a reciprocal linear dispersion (RLD) of 20 
nm/mm. The fibers were pnyected onto an entrance sHt (25Q pm) which yielded a 
spectral resolution of about 5 mn. A thermo-electrically cooled CCD camera 44 
operated at about -30" C (Spectrasouice HPC-l. Westlake VUlage. C A) was located at 
Uie back focal plane of the imaging spectrograph 42. Chip dimensions were 13.8 x 
9.2 mm witii 1536 x 1024 pixels (Kodak KAF-J600 grade 2). yielding a nominal 
spectral range of about 276 nm for a single grating position. Dark current was 
specified as 0.25 electi-ons/pixel/sec when operated at -30" C. The quantum efficiency 
of the lumogen coated chip ranged from a peak of 40% at 550 mn to a low of 15% at 
250 nm. 

The detector and imaging spectrograph . were wavelength calibrated by 
measuring the room light spectra Uiat showed Uiree Mercury, peaks at 404.7. 436 and 
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546 nm. The relauon between pixels and wavelength was then linearly fitted through 
these points. 

Fluorescence and reflectance measurementkwere^^ Prior 
to fluorescence measurements, the white Ught port was closed and pixels illun^^ 
by the fluorescence fibers were selected to be read from the CCD 44. Dark current . 
and A/D conversion offset was measured with, the same setting as the subsequent 
measurement but with a closed camera shutter. These were subtracted from all 
fluorescence and reflectance measurements. The first excitation wavelength was 
selected by scanning the excitation monochromator. the emission fUter wheel was 
rotated to select the appropriate long pass filter and the spectrograph graUng was 
adjusted to record signal over the desh-ed emission wavelength range. The 
monochromator 24 and camera shutters were then opened for the desired exposure 
Ume to record the fluorescence emission spectrum (1.5 seconds). The excitation 
wavelength was then incremented, and the process repeated until all desired excitaUon 
wavelengths have been measured. The excitation wavelengths were incremented from 
330 to 500 nm in 10 nm steps. Table 1 contain§; .a list of the excitation wavelengths 
and corresponding long pass filters and emission wavelength ranges used in this 
Example. 

Following collection of fluorescence spectra, diffuse reflectance spectra were 
then measured. For these measurements, the monochromator shutter was closed, the 
emission filter wheel was set to the lowest filter position and the pixels illuminated by 
the corresponding reflectance collection fibers We^e selected to be read from the CCD 
44. Dark current and A/D conversion offsets were measiH^ and stored for subtracUon 
of the following measurements. The reflectance spectrum was collected over three 
illumination wavelength ranges. Prior to measurement of each range, the appropriate 
long-pass filter was selected in the illumination filter wheel, and the spectrograph 
grating was adjusted to record signal over the desired wavelength range. The lamp and 
camera shutters were then opened for the desired exposure time to record the 
reflectance spectrum (0.4 - 4.8 seconds). The illumination wavelength range was then 
incremented, and the process repeated until all desired wavelength ranges have been 
measured. Exposure times were determined empirically to achieve a signal to noise 
ratio greater than 20. Table 1 contains a Ust of the illumination wavelength ranges 



wo 99/57529 PCT/US99/09768 

30 ■ 

and corresponding long pass filters used for diffuse reflectance measurements. The 
high dynamic range of the reflectance measurements, spanning over three orders of 
magnitude, required that each spatial position be read out individually from the CCD 
44. this prevented saturation and blooming artifacts. 

There are no accepted safety standards for illumination of mucosal surfaces . 
other than skin and cornea. However, the exposure of solar radiation that is 
equivalent to the exposure received when a measurement is made with this system has 
been calculated. The method compares the spectral iiradiance [W/cm" nm] of the 
excitation source with solar irradiance data obtained ftom [NSF Polar Programs UV 
Spectroradiometer Network 1994-1995 Operations Report; NSF UV Radiation 
Monitoring Network 1994 to 1995 Volume 5.0 Data Set. Available at 
WWW.BI0SPHERICAL.COM.]. The comparison includes a point-wise division of 
the irradiance from the FastEEM system to the solar iiradiance at the same 
wavelength. This ratio gives a relative solar exposure factor. The solar data is for a 
sunny day in San Diego. California. Irradiation during fluorescence excitation is less 
tiian 7 times solar exposure at all wavelengUis, Given that fluorescence excitation 
times were 1.5 seconds. Uiis corresponds to exposure to solar radiation for less tiian 1 1 
seconds in any given wavelength band. During diffuse reflectance measurements, the 
lamp exposure is maximum at 300 nm, where the relative exposure is a factor of 25 
tiiat of the sun. Since the total exposure time for this wavelength band is 14 seconds, 
tiie exposure corresponds to 350 seconds or less Uian 6 minutes. All otiier 
wavelengths have relative exposure factors of IQ or less resulting in a shorter 
equivalent total solar exposure. 

Prior to every patient measurement tiie probe output was measured witii a 
calibrated power meter (Newport. Irvine. CA, 818-UV) at 400 nm excitation 
wavelengtii. An average output of 86 ^W +/- 12 ^iW was achieved at this wavelengtii 
with a bandwidth of 6.6 nm. Background fluorescence spectra were measured witii 
the probe dipped in a non-fluorescent botde containing distilled water. This 
background HEM was subtracted from all subsequentiy acquired EEMs to conect for 
room lights and probe autofluorescence. The non-uniform spectral response of Uie 
system was corrected using correction factors determined from measurements of 
caUbration sources; in the visible a N.I.S.T traceable tungsten ribbon filament lamp 
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and in the UV a deuterium lamp were used (SSOC and 45D, Optronic Laboratories 
Inc., Orlando, FL). Variations in the intensity of fluorescence excitation light source 
at different excitation wavelengths were corrected using measureinents of the intensity 
at each excitation wavelength at the probe tip usjing; a calibrated photodiode (818-UV. 
Newport). Background spectra to correct reflectance measurernents for room light 
contributions were measured with all parameters set as for tissue measurements 
except the white light shutter was closed. These measurements were subtracted from 
all subsequent reflectance spectra. 

Huorescence and reflectance standards were measured before each patient 
measurement. The fluorescence intensity was reported relative to tiie fluorescence 
intensity of a solution of 2 mg/L Rhodamine 610 (Exciton. Dayton, OH) in ethylene 
glycol at 460 nm excitation and 580 nm emission. Reflectance data are reported 
relative a 2.68% by volume solution of 1.072 micron diameter polystyrene 
microspheres (Polyscience Inc.. Warrington. PA),;The ipicrosphere standard was used 
for its well-characterized optical properties. Th^ total integrated reflectance of this 
standard was measured on a double beam specti:ophotometer (U-3300 Hitachi. Tokyo. 
Japan) with an integrating sphere attachment (La^sphere lnc, North Sutton. NH). This 
was used to correct the reflectance standard me^urements made witii the FastEEM 
system. Tissue spectra at each collection fiber position were divided pointwise by the 
corrected standard reflectance spectrum at the cori-esponding fiber position. 

The EEMs were assembled offline from each series of fluorescence emission 
scans. Data processing and plotting were performed with Matiab. (The Math Works 
Inc.. Natick. MA). Reflectance spectra were assembled, from tiiree wavelength areas 
giving a range from 380 to 950 nm. The wavelengtii range was further reduced (380 - 
800 nm) to comply with the range of calibration measureinents of the reflectance 
standards on the U-3300. Reflectance data were reported between 380 and 595nm. a 
range where the possible influence of room lights in the measurement was minimized. 
System Validation 

System performance was assessed using two fluorescence standards. The first 
standard was a 2 mg/L Rhodamine 610 (Exciton Inc.. Dayton, OH) ethylene glycol 
solution tiiat is non-scattering, but has peak fluorescence intensity approximately 
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twice the average intensity of human cervix. TTie second standard mimics the optical 
properties of tissue and consists of 20 nM Havin Adenine Dinucleotide (FAD. Kodak, 
Rochester, NY), G.625 vol% polystyrene micro spheres (Polyscience Inc., diameter = 
1.072 urn). 

Both standards were measured with the FastEEM system 10 and a scanning 
spectrofluorimeter (SPEX, Huorolog H. Edison, NJ). The EEMs measured with the 
SPEX were considered as standards since the performance of the system is well 
documented (dynamic range=10^, spectral resolution 5" nm, corrected for non-uniform 
spectral response). The excitation light was incident perpendicular to the sampling 
cuvette and the emitted light was collected at approximately a 20 degree angle with 
respect to excitation light. A front focus arrangement with a 10 mm cuvette was used 
in the SPEX. 60 minutes were required to collect a full EEM from each sample with 
the SPEX. 

Clinical Studies ■ . ,v..\ . . . . . 

In vivo data were obtained from a group of patients with a known or suspected 
premalignant or malignant lesions of the oral cavity. The studies were reviewed and 
approved by the Internal Review Board of the University of Texas at Austin and the 
Surveillance Committee at the UT MD Anderson Cancer Center (Houston). Informed 
consent was obtained from each person in the study. Before using the probe, it was 
disinfected with Metricide (Metrex Research Corp.) in accordance with the standard 
clinical protocol. Background fluorescence EEM and reflectance spectra were 
measured by dipping the fiber optic probe in a' non-flubrescent bottle filled with 
deionized water. These EEMs and spectra correspond to the system autofluorescence, . 
and were subtracted from all subsequently acquired EEMs for that patient. Next an 
EEM was measured from a Rhodamine calibration standard and a reflectance 
spectrum was measured from a polystyrene soluUon calibration standard. The probe 
was then guided to the tissue site to be examined and its dp positioned flush with the 
tissue. A fluorescence EEM and reflectance spectra were obtained from sites within a 
lesion and a clinically normal site. Post-spectroscopy, a 2-4 mm biopsy of the tissue 
was taken from normal and abnormal sites where the probe measured spectra. These 
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specimens were evaluated by an experienced pathologist, Bonnie Kemp. M.D.. using 
light microscopy and classified using standard diagnostic criteria. 
Data Analysis 

One of the goals of the Fast EEM instrument 10 is to provide information for 
the identificaUon of excitation wavelengths suitable for the differentiation of tissue of ' 
differing pathological characteristics, as well as identification of the chromophores 
responsible for the differences. While all such information is present in the EEMs 
collected, it can be difficult to extract due to tiie dimensionality of the data set. A 
method was devised to separately characterize the excitation and emission 
characteristics of the data set. 

Given that Uie EEM has dimensions corresponding to (X,. the following 
autocorrelation vectors are defined: 

x« ) = Z>em(>.„x„ J eem(a., .X„ j 

"-(^n) = 2!,EEM^.,Aj.EEM^,,.;^„) :; 

where xM is the excitation autocorrelation vector and is the emission 

autocorrelation vector. Essentially, the emission autocorrelation vector is the diagonal 
of the product of the EEM with its transpose, and the excitation autocorrelation vector 
is the diagonal of tiie product of the transpose of thfe EEM with the EEM. Note that in 
signal processing terms, the autocorrelation vectors; x.v and m^v. are a measure of Uie 
average signal of tiie EEM at each excitation or emission wavelength, respectively. In 
this way tiiey provide qualitative information about aiid EEWi. 

An example with simulated data is presented in FIGS. 16A and 16B to 
iUustrate how autocorrelation Vectors reflect changes in fluorescence peak positions in 
EEMs. Two kinds of changes are simulated in tiie modeled data: a shift in tiie 
excitation wavelength at which a fluorescence peak appears, and a shift in the 
emission wavelength at which a fluorescence peak appears. The original peak in the 
EEM was modeled as a single gaussian at 380 nm excitation. 550 nm emission witfi a 
FWHM of 35 nm in emission and excitation wavelengths. The original peak was then 
shifted by 30 nm in excitation as shown by arrow 1 in HG. 16 A. The shift in 
emission wavelength is shown by arrow 2 in HG. 16A. and corresponds to a 30 nm 
shift in tiie emission peak of tiie original data. Three sets of autocorrelation vectors 
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were computed: one for the EEM with the original peak, one for the EEM with the 
excitation wavelength-shifted peak, and one for the EEM with the emission 
wavelength-shifted peak. The autocorrelation vectors are shown in HQ. 16B. 
Comparing the vectors for the original EEM (row 1 in HG. 16B) with the vectors 
from the EEM with the excitation wavelength-shifted EEM (row 2 in HG. 16B), it is 
seen that the excitation autocorrelation vector is sensitive to the change in excitation 
wavelength but not in emission wavelength. Similarly, comparing the autocorrelaUon 
vectors for the original EEM with the vectors from the EEM with the emission 
wavelength shift in the peak (row 3 in HG. 16B) shows that the emission 
autocorrelation vector is sensitive to the changes in emission wavelength but not 
excitation wavelength. 

It is sometimes desirable to normalize the autocorrelation vectors to facilitate 
comparisons between different sets of measurements. Normalized autocorrelation 
vectors have been calculated by dividing these vectors by their RMS value, in effect 
forcing the area of the vector to one unit of signal energy. The normalized emission 
autocorrelation vector is well suited for the identification of differential features in 
EEMs, such as the shifting or broadening of fluorescence peaks. 
Results and Discussion: 

FIGS. 17A and 17B show fluorescence EEMs of the non-scattering 
Rhodamine standard and the scattering FAD phantom obtained with a FastEEM 
system 10. Intensities are reported relative the Rhodamine intensity measured at 460 
nm excitation and 580 nm emission wavelength. HGS. 18A and 18B show 
fluorescence emission spectra of the Rhodamine standard obtained at 370 and 450 nm 
excitation with the SPEX and the FastEEM system 10 as well as the fluorescence 
background. FIG. 18B and 18D show the same spectra for scattering FAD phantom 
obtained at the same excitation wavelengths. The spectra are normalized at thek 
maximum. Note the presence of Rayleigh scattering peaks from the excitation source 
in the data taken with the SPEX. In general, from non-scattering samples (HG. 18A. 
18C) the FastEEM system 10 collects less light above 600 nm than the SPEX. This 
may be due to the different collection efficiencies of the FastEEM probe and the front 
face collection geometry of the SPEX. Under scattering conditions and with lower 
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Huorescence signal, the influence of background fluorescence becomes more critical. 
At 370 nm excitation wavelength the FastEEM system l6 measures more fluorescence 
below 500 nm. A comparison with the measiired fluorescence background however 
shows that the additional signal has the same shape as the background. It has been 
hypothesized that the background may have been underestimated by measuring it in a 
non-scattering non-fluorescent media. 

In-vivo fluorescence EEMs of the oral cavity were measured from 71 sites and 
in-vivo reflectance specu-a were measured from 49 sites, these were obtained from 
patients in two smdies. The first study included patients with abnormal oral lesions 
identified in a previous medical examination (17 patients). The second smdy, 
contributing nine patients, was of normal volunteers. All sites interrogated 
spectroscopically in paUents with lesions were biopsied and submitted for 
histopathological analysis. Spectra and biopsies were also obtained from a 
contralateral site with no lesion in these patients with abnormal lesions. These 
biopsies were also evaluated histppathologically. No biopsies were taken from the 
normal volunteers. In this Example, the invehtors show representative EEMs from 
tissue found to be histopathologically normal . Md malignant to iUustrate spectral 
features detectable with the FastEEM system. ; , . . . , 

Two EEM contour plots from a normal and an abnormal area of the tongue are 
presented in HGS. 19A and 19B, respectively. In the normal sample, fluorescence is 
observed throughout the whole collection range,, with a peak located at 330/380 
(excitation/emission) and a ridge extending from 340/450 to 450/500. Table H lists 
excitation.emission maxima pairs of endogenous tissue chromophores. Comparison 
of the observed peaks with Table H shows these peaks are consistent with the 
emission of structural proteins such as collagen and elastin, pyridine nucleotides 
(NADH) and fiavoproteins (FAD). The noimal site shows overall increased 
fluorescence with respect to the abnormal site shown in FIG. 19B. The abnormal site, 
assessed by a pathologist as being moderately differentiated squamous cell carcinoma, 
also shows broad fluorescence throughout. Peaks are. observed at 330/380, 350/460, 
460/520 and 500/630. A valley is seen at 420. nm excitation between 560 and 580 
emission. This vaUey is seen to extend along the 420 nm excitaUon Une as well as the 
580 nm emission line. Table HI suggests that these feamres are produced by 
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hemoglobin reabsorption. Hemoglobin reabsorpUon may also in part account for the 
shift in the peaks of the abnormal EEM relaUye tO:the normal EEM. A summary of 
the excitation and emission maxima for the peaks observed in the normal and 
abnormal sites measured is presented in Table IV. 

Fluorescence emission spectra at three ; selected excitation wavelengths are 
shown in FIG. 20, illustrating changes in relative intensiUes of fluorescence emission. 
For comparison purposes each set (normal/abnormal) was normalized to the 
maximum at 350 nm excitation. HG. 20(a) shows the emission spectra at 350 nm 
excitation. Fluorescence from the normal site is seen as a broad peak with a maximum 
at 455 nm. The peak from the abnormal site is, seen to be narrower and red-shifted. 
Examination of this spectnim at 410, 540 and 580 nm suggests that the change in 
lineshape is due to oxygenated hemoglobin. The general line shapes of the 
fluorescence observed at 410 nm excitation (FIG, 20(b)) are seen to be similar for 
both sites in the 450-575 nm excitation range, ;with a broad peak at 500 nm. The 
abnormal site shows a significantly lower fluorescence intensity, as well as an extra, 
nan-ow fluorescence peak at 640 nm. attributed to porphyrin fluorescence. HG. 20(c) 
shows the emission spectra at 460 nm excitation. The normal site shows a broad peak 
at 520 nm and clear modulation from hemoglobin reabsojrption at 540 and 580 nm. 
Fluorescence from the abnonnal site shows an even more marked hemoglobin 
reabsorption; also the overall fluorescence intensity is reduced. 

FIGS. 21A and 21B show the emission and excitaUon autocorrelation vectors 
for the same measurements. Note that the plots have , a logarithmic y-axis. The 
emission autocorrelation vectors have a large broad peak at 460 nm corresponding to 
the main fluorescence peak observed in the EEMs. The vectors show the effect of 
hemoglobin absorption around 410, 540 and 580 nm in tiie abnormal site and the 
presence of additional fluorescence in the UV in the normal sample (FIG. 21 A). This 
autocorrelation vector also highlights the peak at 610 nm in the abnonnal sample. 
The excitaUon autocorrelation vectors show different line shapes. The curve 
corresponding to tiie normal site decreases steadily from 330 nm to 500 nm excitaUon. 
The curve from the abnormal site shows a peak at 350 nm and a minimum at 410 nm. 
The latter illustrates the greater influence of hemoglobin reabsorption in the abnormal 
sample also shown in FIG. 20. 
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The corresponding reflectance data is shown in FIGS. 22A-22C. Position 1 
corresponds to the collection fibers closest to the source fiber and position 3 to those 
furthest from the source fiber as shown in HG. 15. The difference in position allow 
for spatially resolved reflectance measurements. Differences induced by the 
fluorescence reabsorption of oxygenated hemoglobin in the normal site and abnormal 
site are shown. The modulation of the spectrum, by tiie 540 and 580 absorption bands 
is seen to be significantly stronger in the abnormal saniple; tiiis is consistent with tiie 
increased reabsorption seen in the fluorescence., spectrum of the abnormal sample. 
The reflectance in tiie blue range (450-500nm) of tiie abnormal site is consistentiy 
higher than tiiat of the normal site. Below 450 nm tiie reflectance seems not to differ 
between the normal and abnormal samples. 
Conclusions 

The total data acquisition time for the data presented here was 2.5 minutes for 
a fluorescence EEM. and 1.5 minutes for the spatially resolved reflectance 
measurements. However, only 29 seconds of this time represented fluorescence 
collection. Actual reflectance collection time was 26 seconds. The most time 
consuming process was changing the excitation Wavelength using the stepper motor 
controlled excitation spectrograph and changi:ng the corresponding long-pass filter 
using the remotely controlled filter wheel. Wohn drive based monochromators are 
available (DDD180, ISA) which require less than 10 seconds to scan our entire 
wavelengtii range in 10 nm steps, and could substantially reduce the total 
measurement lime. Using a higher power lamp may furtiier reduce acquisition time of 
both fluorescence and reflectance. 

This Example has demonsti-ated the acquisition of EEMs in combination witii 
spatially resolved reflectance measurements of tissue phantoms and in tiie oral cavity 
in vivo with good signal to noise ratio. The. system features easy and arbitrary 
selection of excitation wavelengths in the UV and visible range. The system is also 
portable, and capable of fiinctioning in a hospital operating room. Probes used in tiie 
Fast EEM system incorporate channels to measure spatially resolved reflectance and 
fluorescence, and are built small enough (less :tiian about 5mm) to be used during 
endoscopic surgical procedures. Autocorrelation vectors x.v and m.v are a suitable 
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method to reduce the data set while preserving information about the wavelength 
bands carrying information. Based on the representative data shown here, fluorescence 
emission and excitation as well as reflectance data appear promising for the 
identification of tuinors of the oral cavity. The Fast EEM system is an ideal tool to 
identify a subset of the most promismg optica] features to identify pathological 
findings in large clinical smdies. 
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EXAMPLE 2 

Cervical Pre-Cancer Detection Using A Multivariate Statistical Algorithm Based On 
Laser Induced Fluorescence Spectra At Multiple Excitation Wavelengths 

A portable fluorimeter was developed and utilized to acquire fluorescence 
spectra from 381 cervical sites in 95 patients at 337, 380 and 460 nm excitation 
immediately prior to colposcopy. A multivariate statistical algorithm was used to 
extract clinically useful information from tissue spectra acquired in vivo. Two full- 
parameter algorithms were developed using tissue fluorescence emission spectra at all 
three excitation wavelengths (161 excitation-emission wavelength pairs) for cervical 
pre-cancer (squamous intraepithelial lesion (SIL)) detection: a screening algorithm 
which discriminates between SILs and non SILs with a sensitivity of 82%±1.4 and 
specificity of 68%±0.0. and a diagnostic algorithm which differentiates high grade 
SILs from non high grade SILs with a sensitivity and specificity of 79%±2 and 
78%±6, respectively. Multivariate statistical analysis was also employed to reduce the 
number of fluorescence excitaUon-emission wavelength pairs needed to re-develop 
algorithms that demonstrate a minimum decrease in classification accuracy. Two 
reduced-parameter algorithms which employ fluorescence intensities at only 15 
excitation-emission wavelength pairs were developed: the screening algorithm 
differenUates SILs from non SILs with a sensitiyity of 84%±1.5 and specificity of 
65%±2 and the diagnostic algorithm discriminates. high grade SILs from non high 
grade SILs with a sensitivity and specificity of 78,%±0.7 and 74%±2. respectively. 
Both the full-parameter and reduced-parameter screening algorithms discriminate 
between SILs and non SILs with a similar specificity (±5%) and a substantially 
improved sensitivity relative to Pap smear screening. A comparison of the full- 
parameter and reduced-parameter diagnostic algorithms to colposcopy in expert hands 
indicated that all three have a very similar sensitivity and specificity for differentiating 
high grade SILs from non high grade SILs. 

This paper presents the development and application of a detection technique 
for human cervical pre-cancer based on laser induced fluorescence spectroscopy. A 
portable fluorimeter consisting of two nitrogen pumped-dye lasers, a fiber-optic probe 
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and a polychromator coupled to an opucal muia-channel analyzer was utUized to 
acquire fluorescence spectra from 381 cervical sites in 95 patients at three excitation 
wavelengths: 337, 380 and 460 nm. A general multivariate statistical algorithm was 
then used to analyze and extract cUnically useful information from tissue spectra 
acquired in vivo. First, a screening algorithm was developed to discriminate between 
SILs and non SILs (normal squamous and columnar epitheUa and inflammation); 
second, a diagnosuc algorithm was developed to differentiate HG SILs from non HG 
SILs (LG SILs. normal epithelia and inflammation). The retrospective and prospective 
accuracy of both the screening and diagnostic algorithms were compared to the 
accuracy of Pap smear screening and to colposcopy in expert hands. 

The general multivariate statistical algorithm was initially developed and 
tested using cervical tissue spectra acquired at 337 nm excitation from 476 cervical 
sites in 92 patients. This algorithm could be used to differentiate SILs and normal 
squamous tissues with an average sensitivity and specificity of 91%+2 and 78%±3, 
respectively. A limitation however is that specti-a of normal columnar tissues and 
inflammation were indistinguishable from those of SILs at this single excitation 
wavelength. Furthemiore, a multivariate statistical algorithm based solely on spectra 
at 337 nm excitation could not discriminate, between, HG SILs and LG SELs 
effectively. 

However, multivariate statistical analysis of cervical tissue fluorescence 
spectra acquired in vivo at 380 nm and 460 nm excitation from a subset of the 92 
patients indicated that spectra at tiiese excitation wavelengtiis can overcome the 
limitations of spectra at 337 nm excitation. Spectra at 380 nm excitation from 165 
sites in a first group of 40 patients could be used to differentiate SILs from normal 
columnar epithelia and inflammation with a sensitivity and specificity of 77%± 1 and 
72%±9, respectively; spectra at 460 nm excitation from 149 sites in a second group of 
24 patients could be used to differentiate HG SILs from LG SILs with a sensitivity 
and specificity of 80%±4 and 76%±5, respectively. 

The results from previous clinical studies suggested that an algoriUim based on 
normalized, mean-scaled spectra at 337 nm excitation may be used to differentiate 
between SILs and normal squamous tissues, while an algorithm based on similarly 
pre-processed spectra at 380 nm excitation may be used to differentiate SILs from 
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normal columnar tissues and samples with inflammation. Finally, a third algorithm 
based on normalized tissue spectra at 460 nm excitation may be used to discriminate 
between LG SILs and HG SIU. These results suggest that (1) a co/7ipo5i/e screening 
algorithm based on a combination of the first two constituent aigovithms may be used 
to differentiate between SBLs and non SILs (normal epithelia and inflammation) and 
(2) a composite diagnostic algorithm which combines all \hree constituent algorithms 
may be used to differentiate HG SILs from non HG SILs (LG SILs, normal tissues and 
inflammation). 

The primary goal of the clinical study described in this Example was to 
evaluate the accuracy of constituent and composite algorithms which address certain 
limitauons of previous clinical studies. Fluorescence spectra acquired in vivo at all 
three excitation wavelengths from 381 cervical sites in 95 patients were analyzed to 
detennine if the accuracy of each of the three constituent algorithms previously 
developed may be improved using tissue spectra^at a combination of two or three 
excitation wavelengths rather than at a single excitation wavelength. A second goal of 
the analysis was to integrate the three independentjy developed constituent algorithms 
that discriminate between pairs of tissue types into composite screening and diagnostic 
algorithms that may achieve discrimination between .many of the cUnically relevant 
tissue types. The effective accuracy of a camposite , scrtemng algoritiim for the 
identification of SILs and a composite diagnostic algorithm for Uie identification of 
HG SILs was evaluated. 

The final goal of the analysis was to detennine if fluorescence intensities at a 
reduced number of excitation-emission wavelength pairs may be used to re-develop 
constituent and composite slgonthms that may achieve classification with a minimum 
decrease in predictive ability. A significant reduction in the number of required 
fluorescence excitation-emission wavelength pairs may enable tiie development of a 
cost-effective clinical fluorimeter. The accuraqy. of the constituent and composite 
algorithms based on the reduced emission variables was compared to the accuracy of 
tiiose that utilize entire fluorescence emission spectra. 
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Instrumentation 

A schematic of the portable fluorimeter which was used to acquire cervical 
tissue fluorescence spectra at three excitation wavelbngths is shown in FIG. 23(a). The 
fiber-optic probe (Valdor Fiber Optics, VSC/FER/4SMA-1/7-BUN) included a central 
fiber surrounded by a circular array of six fibers; all seven fibers having the same 
characteristics (0.22 NA, 200 ^un core diameter, 245 urn diameter with cladding). 
Three fibers along the diameter of the distal end of the probe (HG. 23(b)) were used 
for excitation light delivery. The purpose of the remaining four fibers was to collect 
the emitted fluorescence from the area directiy iiluminated by the probe. A quartz 
shield (3 mm in diameter and 2 mm thick) at the tip of the distal end of the probe that 
is in direct tissue contact (HQ. 23(c)) provided a fixed distance between the optical 
fibers and the tissue surface so fluorescence intensity can be measured in calibrated 
units. 

An area, 1 mm m diameter was illuminated by each excitation fiber. The 
overlap of the illumination area viewed by the three excitation fibers and the four 
coUection fibers was approximately 80% at the outer surface of the quartz shield. Note 
that the central excitation fiber has four adjacent collection fibers whereas the two 
excitation fibers in the periphery of the probe have only two adjacent collection fibers 
(FIG. 23(b)). However, due to the large overlap of the optical fibers at the outer face 
of the quartz shield, this difference in the excitation-emission configuration relates 
only to a small difference in the collection efficiency of the fluorescence generated 
due to excitation delivered by the central and peripheral excitation fibers. The 
difference in collecdon efficiency is accounted for, by normalizing tissue fluorescence 
spectra to the peak fluorescence intensity of a khpdamine 610 calibration standard 
measured using the same probe configuration. 

Two nitrogen pumped-dye lasers (laser characteristics: 5 ns pulse duradon. 30 
Hz repeUtion rate) (Laser Photonics. LN300C) were used to provide illumination at 
three different excitation wavelengths: one laser served to deliver excitation light at 
337 nm (fimdamental) and had a dye module which was used to generate light at 380 
nm using the fluorescent dye. BBQ (lE-03 M in 7 parts toluene and 3 parts ethanol). 
The dye module of the second laser was used to provide illumination at 460 nm. using 
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the fluorescent dye. Coumarin 460 (lE-02 M in ethanol). Laser illumination at each 
excitation wavelength. 337, 380 and 460 nm was coupled into each of the three 
excitation fibers of the probe. Note that two 10 hm bandpass filters, one centered at 
380 nm and the other centered at 460 nm were placed between the excitation fiber and 
the dye module used to generate illumination at 380 and 460 nm. respectively to 
prevent leakage from the fundamental at 337 rnn. In this Example, the average fluence 
per pulse at 337, 380 and 460 nm excitation were 15.2, 11.5 and 18 pJ/mm". 
respectively. The pulse energy at 337 nm excitation was intentionally reduced so that 
tiie measured fluorescence signal did not exceed the dynamic range of the detector. 

The proximal ends of the four coUection fibers were arranged in a circular 
array and imaged at the 500 nm wide entrance slit of a f^3.8 spectrograph equipped 
with a 300 In/mm grating (Jarrell Ash, Monospec 18) coupled to a 1,024 intensified 
diode array controlled by a multi-channel analyzer (Princeton Instruments. OMA). 
The collection optics between the proximal ehd of the four emission collection fibers 
and tiie polychromator included two quartz piano convex lenses. Between tiiese lenses 
was a filter wheel assembly containing long pass filters with 50% ti-ansmission at 360 
(GG360). 400 (GG400) and 475 (GG475) nm which are used to block scattered 
excitation light at 337, 380 and 460 nm excitation, respectively from tiie detector. The 
purpose of tiie filter wheel was to position Uie .appropriate long pass filter in the 
optical path during fluorescence measurements at each excitation wavelength. The 
niti-ogen pumped-dye lasers were used to externally trigger a pulser (Princeton 
Instruments. PG200) which served to synchroni?e the 200 ns collection gate of the 
detector to the leading edge of the laser pulse. The gating of the detector eliminated 
tiie effects of the colposcope's white light .illumination during fluorescence 
measurements. Data acquisition was computer controlled. 
Clinical measurements 

A randomly selected group of non-pregnant patients referred to the colposcopy 
clinic of the University of Texas MD Anderson Cancer Center on tiie basis of 
abnormal cervical cytology was asked to participate in tiie in vivo fluorescence 
spectroscopy study. Informed consent was obtained from each patient who 
panicipated and the study was reviewed and approved by the Listitutional Review 
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Boards of the University of Texas. Austin and the University of Texas. MD Anderson 
Cancer Center. Each patient underwent a > complete history and a physical 
examinadon including a pelvic exam, a Pap smear and colposcopy of the cervix, 
vagina and vulva. After colposcopic examihatipn of the cervix, but before tissue 
biopsy, fluorescence spectra were acquired on average from two colposcopically 
abnormal sites, two colposcopically normal squamous sites and 1 normal columnar 
site (if colposcopically visible) from each patient Tissue biopsies were obtained only 
from abnormal sites after they had been identified by colposcopy and then analyzed by 
the probe. Tissue biopsies were not obtained from normal squamous or columnar sites 
analyzed by the probe to comply with routine patient care procedure. All tissue 
biopsies were fixed in formalin and submitted for histologic examination. 
Hemotoxylin and eosin stained sections of each biopsy specimen were evaluated by a 
panel of four board certified pathologists and a consensus diagnosis was established 
using the Bethesda classification system. . This .classification system which has 
previously been used to grade cytologic specimens has now been extended to 
classification of histology samples. Samples: were classified as normal squamous, 
normal columnar, inflammation. LG SJL or HQ ^JL. Sarnples witii multiple diagnoses 
were classified into the most severe histo-patiiologic category. 

Prior to each patient study, the probe was. disii^fected and a background 
spectrum was acquired at all three excitation wavelengths consecutively witii the 
probe dipped in a non-fluorescent bottie containing distilled water. The background 
spectrum indicated no fluorescence due to optical components of tiie fluorimeter or 
die disinfectant and was subtracted from all subsequently acquired spectra at 
corresponding excitation wavelengths for that patient. Next, witil the probe placed oh 
the face of a quartz cuvette containing a solution of Rhodamine 610 dissolved in 
etiiylene glycol (2 mg/L). 50 fluorescence spectra were measured at each excitation 
wavelength. After calibration, fluorescence spectra were acquired from the cervix: 10 
spectra for 10 consecutive pulses were acquired at 337 run excitation; next. 50 spectra 
for 50 consecutive laser pulses were measured at 380 nm excitation and then at 460 
nm excitation. The data acquisition time was 0.33 s at 337 nm excitation and 1.67 s at 
each 380 and 460 nm excitation per cervical site. The time required to switch between 
die two nitrogen pumped-dye lasers and the diree long pass filters was approximately 
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5 s. Hence, the total time required to record fluorescence emission spectra at all three 
excitation wavelengths from one cervical site was approximately 10 s. Spectra were 
collected m the visible region of the electromagneUc spectrum with a resolution of 10 
nm (ftiU width at half maximum) and a signal tb noise ratio of 100:1 at the 
fluorescence maximum at each excitation wavelength. 

All spectra were corrected for the non-uniform spectral response of the 
detection system using correction factors obtained by recording the spectrum of an 
N.I.S.T traceable calibrated tungsten ribbon filament lamp: Spectra from each cervical 
site at each excitation wavelength were averaged to obtain a smgle spectrum per site. 
The fluorescence spectra obtained at each excitation wavelength from the Rhodamine 
610 calibration standard were also averaged to obtain a single spectrum per excitaUon 
wavelength. The average tissue spectra were then normalized to the average peak 
fluorescence intensity of the Rhodamine 610 calibration standard at the corresponding 
excitation wavelength for that patient; absolut^ fluorescence intensities are reported in 
these calibrated units. In this clinical study, fluorescence spectra were acquired at all 
three excitation wavelengths from each cervical; site from a total of 381 sites in 95 
patients during colposcopy. 

Development of screening and diagnostic algorithms 

FIG. 24 illustrates a schematic of the formal analytical process used to develop 
screening and diagnostic algorithms for the differential detecuon of SILs, in vivo. In 
HG. 24. the text in the dashed-line boxes * represent the mathematical steps 
implemented on the spectral data, and the text in the solid-line boxes represent the 
output after each mathematical process, There are four primary steps involved in the 
multivariate statistical analysis of tissue specu-al data (HG. 24). The first step is to 
pre-process specti-al data to reduce inter-patient and intra-patient variation within a 
tissue type; the pre-processed spectra are then dimensionally reduced into an 
informative set of principal components that describe most of the variance of tiie 
original spectral data set using Principal Component Analysis (PCA). Next, tiie 
principal components that contain diagnosticaliy relevant information are selected 
using an unpaired, one-sided student's t-test, and finally a classification algoritiim 



46 

based on logisUc discrimination is developed using these diagnosUcally relevant 
principal components. 

In summary, three constituent algorithms were developed using multivariate 
statistical andysis (Rg. 24): co«.rir„e;,, algoritto^ 

normal squamous tissues, constituent algorithm (2) discriminates between SILs and 
normal columnar tissues and finally. algorithm-O) differentiates HG SILs from LG 
SILs. The three constituent algorithms were then combined to develop two composite 
algorithms (Fig. 24): constituent algorithms (1) and (2) were combined to develop a 
composite screening algorithm which discriminates between SILs and non SILs. All 
three constituent algorithms were then combined to develop a composite diagnostic 
algorithm which differentiates HG SILs from non HG SIU. 
Multivariate statistical analysis of cervical tissue spectra 

As a first step, three methods of pre-processing were applied to the spectral 
data at each excitation wavelength: (1) normalization (i) mean-scaling and (3) a 
combination of nonnalization and mean-scaling. Similarly pre-processed spectm at 
each excitation wavelength were combined to ci^ate spectral inputs at the following 
combinations of excitation wavelengths: (337.' 46(i) nm. (337. 380) nm, (380. 460) mn 
and (337, 380. 460) nm. Pre-processing of spectral data insulted in four types of 
spectral inputs (original and three types of pre-processed spectral inputs) at three 
smgle excitation wavelengths and at four possible combinations of multiple excitation 
wavelengths. Hence, there were a total of 12 ^tral inputs at single excitation 
wavelengths and 16 spectral inputs at multipl^ excitation wavelengths which wer^ 
evaluated using the multivariate statistical algorithm. 

Prior to PCA. the input data matrix. D (r x c) was created so each row of tiie 
matrix corresponded to the pre-processed fluorescence spectrum of a sample and each 
column corresponded to the pre-processed fluorx:scence intensity at each emission 
wavelength. Spectral inputs at multiple excitation wavelengths were created by 
arranging spectra at each excitation wavelength in series in the original spectral data 
matrix. PCA was used to dimensionally reduce the pre-processed spectral data matrix 
into a smaller orthogonal set of linear combinations of the emission variables that 
account for most of tiie variance of tiie spectral data set. 
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Average values of principal component scores Were calculated for each 
principal component of each Ussue type. An unpaired, one-sided student's t-test was 
employed to determine the diagnostic content of each principal component. The 
hypothesis that the means of the principal component scores of two tissue types are 
different was tested for (1) normal squamous epithelia and SILs. (2) normal columnar 
epithelia and SJLs and (3) inflammation and SlUi. The t^est was extended a step 
further to determine if there were any statisticairy significant differences between die 
means of the principal component scores of HG SILs and LG SBLs. Principal 
components for which the hypothesis stated above was statisticaUy significant (P < 
0.05) were retained for fiirther analysis. 

Next, a statistical classification algorithm was developed using the 
diagnostically useful principal components to calculate the posterior probability that 
an unknown sample belongs to each tissue type under consideration. The posterior 
probability of an unknown sample belonging to each tissue type was calculated using 
logistic discrimination. The posterior probability is related to the prior and conditional 
joint probabilities and to the costs of misclassification of the tissue types under 
consideration. The prior probability of each tissue type was determined by calculating 
Uie observed proportion of cases in each group. Tlie cost of misclassification of a 
particular tissue type was varied from 0 to 1 in 0,1 increments, and the optimal cost 
was identified when the total number of riiisclassified samples based on the 
classification algorithm was a minimum. If tiiere .was more than one cost at which tiie 
total number of misclassified samples was a jninimum, tiie cost that maximized 
sensitivity was selected. The conditional joint probabilities were developed by 
modeUng tiie probability distiibution of each principal component of each tissue t^e 
using the normal probability density fimction, which is characterized by fi (mean) and 
a (standard deviation). The best fit of tiie nonnal probability density fimction to tiie 
probability distribution of each principal component (score) of each tissue type was 
obtained in the least squares sense, using m and a as free parameters of the fit. The 
normal probability density function was tiien used to calculate tiie conditional joint 
probability tiiat an unknown sample, given tiiat it isirom tissue type i, will exhibit a 
set of principal component scores, X. 
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The multivariate statistical algorithm was developed and optimized using a 
calibration set and tiien tested on a prediction set of approximately equal prior 
probabUity (Table 1). The puipose of testing the algoritiun on the prediction set was to 
determine (1) an unbiased estimate of the algorithm's classification accuracy and (2) if 
the number of sample spectra within each category in the calibration set is sufficient 
to describe tiie spectral data in tiie prediction set. The calibration and prediction sets 
were developed by randomly assigning tiie spectral data into tiie two sets widi the 
condition that both contain roughly equal number of samples from each histo- 
pathologic category. The random assignment ensured that not all spectra from a single 
patient were contained in the same data set. 
Development of constituent algorithms 

The multivariate statistical algoritiun was developed and optimized using all 
28 types of pre-processed specU-al inputs from the calibration set. The algorithm was 
used to identify spectral inputs which provide the/greatest discrimination between tiie 
following pairs of tissue types: (1) SELs and normal squamous epithelia, (2) SILs and 
normal columnar epitiielia, (3) SILs and inflammation, and (4) HG SILs and LG SILs. 
The optimal specti-al input for differentiating between two particular tissue types was 
identified when tiie total number of samples misblassified from the calibration set 
usmg the multivariate statistical algorithm was a minimum. The algorithm based on 
tiie spectral input tiiat minimized misclassification between the two tissue types under 
consideration was implemented on the prediction data set. 

Three multivariate statistical constituent algorithms were developed using 
tissue specti-a at tiiree excitation wavelengtiis. Constituent algoritiun (1) was 
developed to differentiate between SILs and normal squamous epitiielia; constituent 
algorithm (2) was developed to differentiate between SILs and normal columnar 
epitiielia and constituent algorithm (3) could be used to discriminate between LG SILs 
and HG SILs. A constituent algorithm which, can discriminate between SILs and 
tissues with inflammation could not be developed using specti-al data from tiie cuirent 
clinical study. 
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Development of composite algorithms 

Each of the independenUy developed constituent algorithms was intended to 
discriminate only between pairs of tissue types. A combination of constituent 
algorithms was required to provide discrimination between several of the clinicaUy 
relevant tissue types. Therefore, two composite algorithms were developed: a 
composite screening algorithm was developed to differentiate between SILs and non 
SILs (normal squamous and columnar epithelia and inflammation) using constituent 
algorithms (1) and (2) and a composite diagnostic algorithm was developed to 
differentiate HG SILs from non HG SE^ (LG SILs. normal epithelia and 
inflammation) using all three co/w/imen/ algorithms. 

The composite screening algorithm was developed in the following manner. 
First, constituent algorithms (1) and (2) were developed independenUy using the 
calibration data set. The classification outputs from both constituent algorithms were 
used to determine if a sample being evaluated is SIL or non SEL: first, using 
constituent algorithm (1). samples were classified as non SIL if they had a probability 
that is less than 0.5; otherwise, they were classified as SIL. Next, only samples that 
were classified as SIL based on the algorithm (1) were tested using algorithm (2). 
Again, samples were classified as non SIL if their posterior probability was less than 
0.5; otherwise they were classified as SDL. The spectral data from the prediction set 
was evaluated using the composite screening algorithm in an identical manner. 

The composite diagnostic algorithm was implemented in the foUowing 
manner. The three constituent algorithms were developed independenUy using Uie 
calibration set. Algorithms (1) and (2) were implemented oh each sample from Uie 
calibration data set, as described previously. Only samples that were classified as SIL 
based on algorithms (1) and (2) were tested using algorithm (3). If samples evaluated 
using algoriUim (3) had a posterior probabUity greater than 0.5, they were classified as 
HG SIL; oUierwise Uiey were classified as non HG SIL. The spectral data from Uie 
prediction set was evaluated using die compo5/7e, diagnostic algoriUim in an identical 
manner. 
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Results 

Constituent algorithms (1), (2) and (3) 

Table 2 summarizes the components of the optimal set of three constituent 
algont\m\s. Constituent algorithm (1) can be used to differentiate between SILs and 
normal squamous epithelia; algorithm (2) differentiates between SILs and normal 
columnar epithelia and algorithm (3) discriminates between LG SILs and HG SILs. 
Pre-processing 

HG. 25(a) illustrates average fluorescence specUa per site acquired from 
cervical sites at 337 nm excitation from a typical, patient. All fluorescence intensities 
are reported in the same set of calibrated units. Corresponding normalized and 
normalized, mean-scaled spectra are illustrated in HG. 25(b) and 25(c). respectively. 
Evaluation of the original spectra at 337 nm excitation (Fig. 25(a)) indicates that the 
fluorescence intensity of SILs is less than that of the corresponding normal squamous 
tissue and greater than that of the corresponding normal columnar Ussue over the 
entire emission spectrum. Examination of normaUzed spectra from this patient (Fig. 
25(b)) indicates that following normalization, tiie fluorescence intensity of the normal 
squamous tissue is greater than that of corresponding SILs, oyer die wavelength range 
360 to 450 nm only; between 460 and 600 nm. the fluorescence intensity of SILs is 
greater than that of the corresponding normal squamous tissue which in part reflects 
the longer peak emission wavelength of SILs. A comparison of the spectral line shape 
of SILs to that of the normal columnar tissue illustrates the.ppposite phenomenon. The 
normalized fluorescence intensity of SILs is greater than that of tiie corresponding 
normal columnar tissue over the wavelength range 360 to 450 nm; however, between 
460 and 600 nm, the fluorescence iiitensity of the norinal columnar tissue is greater 
than that of the SILs; this spectral difference reflects the longer peak emission 
wavelength of the normal columnar tissue relative to that of SILs. Further evaluation 
of nonnalized spectra in Fig. 25(b) indicates that there are spectral line shape 
differences between LG SILs and HG SILs over Uie wavelength range 360 to 420 nm. 

The coiresponding normalized, mean-scaled spectra of this patient, shown in 
Fig. 25(c) displays differences in the normalized fluorescence spectrum (Fig. 25(b)) 
from a particular site with respect to the average normalized spectrum (the average of 
all normalized spectra obtained from this patient). As the average normalized 
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spectrum has been subtracted from each normalized spectrum obtained from this 
patient, the mean now lies at Y=0 over the entire emission wavelength range. 
Evaluation of Fig. 25(c) indicates that between 360 and 450 nm, the normalized, 
mean-scaled fluorescence intensity of the normal squamous tissue is greater than the 
mean, and that of the normal columnar tissue is less than the mean. Above 460 mn, 
the opposite phenomenon is observed; the fluorescence intensity of the normal 
squamous tissue is less than the mean, while that of the normal columnar tissue is 
greater than the mean. The fluorescence intensity of SlLs lies close to the mean and is 
bounded by the intensities of the two normal tissue types. In addition, between 360 
and 420 nm, the normalized, mean-scaled fluorescence intensity of the LG STL is 
slightly greater than the mean, while that of the HG SIL is less than the mean. 

HG. 26(a) illustrates average fluorescence spectra per site acquired from 
cervical sites at 380 nm excitation, from the same patient. HG. 26(b-c) show the 
corresponding normalized, and normalized, meanrscaled spectra, respectively. In Fig. 
26(a), the fluorescence intensi ty of SIU is less than that of the corresponding normal 
squamous tissue, with the LG SIL exhibiting the weakest fluorescence intensity over 
the entire emission spectrum. Note that the fluorescence intensity of the normal 
columnar sample is indistinguishable from that of the HG SIL. Nomialized spectra at 
380 nm excitation, (26(b)), indicate that over the wavelength range 400 to 450 nm, the 
fluorescence intensity of the normal squamous: Ussue is slighUy greater than that of 
SILs and that of the normal columnar tissue is Jess than that of SJLs. The opposite 
phenomenon is observed above 580 nm. A care&l examination of the spectra of the 
LG SIL and HG SIL indicates that between , 460 and 580 nm, the normalized 
fluorescence intensity of the LG SIL is higher than that of the HG SIL. The ; 
nonnalized, mean-scaled spectra (Fig. 26(c)) enhances the previously observed 
normalized spectral line shape differences by displaying them relative to the average 
normalized spectrum of this patient. Fig. 26(c) indicates that between 400 to 450 nm. 
the fluorescence intensity of the normal squamous tissue is greater than the mean and 
that of the normal columnar tissue is less than the mean. The opposite phenomenon is 
observed above 460 nm. The fluorescence intensity of the SILs is bounded by the 
intensities of the two normal tissue types over the entire emission spectrum. The LG 
SIL and HG SIL also show spectral line shape differences; above 460 nm, the 
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nonnalized, mean-scaled fluorescence intensity of the LG SIL Ues above the mean and 
that of the HG SIL lies below the mean. 

HG. 27(a-c) illustrate original, normalized and nonnalized, mean-scaled 
spectra, respecuvely at 460 nm excitation from the same patient. Evaluation of Fig. 
27(a) indicates that the fluorescence intensity of SILs is less than that of the 
corresponding normal squamous tissue and greater than diat of the corresponding 
normal columnar sample over the entire emission spectrum. Evaluation of normalized . 
spectra at Uiis excitation wavelengtii (Fig. 27(b)) demonstrates tiiat below 510 nm, the 
fluorescence intensity of SILs is less than tiiat of the normal squamous tissue and 
greater Uian that of tiie corresponding normal columnar tissue. Above, 580 nm, tiie 
normalized fluorescence intensity of SILs is less Uian that of tiie normal columnar 
tissue and greater tiien that of normal squamous tissue. Note that tiiere are spectral 
line shape differences between the LG SIL and HG SIL between 580 and 660 nm; tiie 
normalized fluorescence intensity of tiie LG SDL is greater tiian tiiat of tiie HG SIL. 
The normalized, mean-scaled spectra shown in Fig. 27(c) reflects the differences 
observed in tiie normalized spectra relative to the average normaUzed spectrum of tiiis 
patient. Below 510 nm, tiie fluorescence intensity of the normal squamous tissue is 
greater than tiie mean, while tiiat of the normal qolumnar tissue is less than tiie mean. 
Above 580 nm, the opposite phenomenon is observed. The fluorescence intensity of 
the SILs lies between tiiose of the two normal tissue types. Above 580 nm, tiie 
fluorescence intensity of the LG SIL is greater than the mean and tiiat of tiie HG SE. is 
less than the mean. : ; 

Principal Component Analysis and Logistic Discrim^^ 

Constituent algorithm (1) which differentiates SILs from normal squamous tissues 

A constituent algoritiim based on normalized spectra arranged in series at all 
tiiree excitation wavelengtiis provided tiie greatest discrimination between SILs and 
nomial squamous tissues. The algorithm demonstrated an incremental improvement in 
sensitivity witiiout sacrificing specificity relative to tiie previously developed 
constituent algoritiim (1) tiiat employed normalized, mean-scaled spectra at 337 nm 
excitation only. Multivariate statistical analysis of normalized tissue spectra at aU 
Uiree excitation wavelengths, indicated tiiree principal components show statistically 
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significant differences between SILs and normal; squamous tissues (Table 2). These 
three principal components account collectively ifor 65% of the total variance of the 
spectral data set. Logistic discrimination was used to develop a classification 
algorithm to discriminate between SILs and normal squamous epithelia based on these 
three informative principal components. Prior probabilities were determined by 
calculating the percentage of each tissue type from the data set: 62% normal 
squamous tissues and 38% SILs. The cost of misclassification of SIL was optimized at 
0.7. Posterior probabilities of belonging to each tissue type were calculated for all 
samples from the data set. using the known prior probabiUties, cost of 
misclassification of SILs and the conditional joint probabilities calculated from the 
normal probability density fiinction. HG. 28 illustrates the retrospective accuracy of 
the algorithm applied to the calibraUon data set. The posterior probability of being 
classified into the SIL category is plotted for all SILs and normal squamous epithelia. 
HG. 28indicates that 92% of HG SILs and 83% of LG SDLs are correcUy classified 
with a posterior probabUity greater than 0.5. Approximately 70% of colposcopically 
normal squamous epithelia are correcUy classified with a posterior probability less 
than 0.5. 

The confusion matrix in Table 3 compares the retrospective accuracy of the 
algorithm on the calibration data set to its prospeiptive accuracy on the prediction set! 
In the confusion matrix, the first row corresponds to the histo-pathologic classification 
and the first column corresponds to the spectroscopic classificaUon of the samples. A 
prospective evaluation of the algorithm's accuracy indicates that there is a small 
increase in the proportion of correctly classified LG SILs and no change in the 
proportion of correctly classified HG SILs or normal squamous tissues. Note that the 
majority of normal columnar tissues and samples with inflammation from both 
calibration and predicUon sets axe misclassified as SIL using this algorithm. 
Evaluation of the misclassified SILs from the calibration set indicates that one sample 
(out of 19) with CIN ffl. two samples (out of 16) with GIN H. two samples (out of 16) 
with GIN I and two samples (out of 7) with HPY: are incorrecUy classified. From the 
prediction set, two samples (out of 19) with CIN m, one samples (out of 16) with CIN 
n, two samples (out of 16) with CIN I and one sample (out of 8) with HPV are 
incorrectly classified as non SIL. 
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Constituent algorithm (2) which differentiates SILsfrom normal columnar tissues 

The greatest discrimination between SBLs and normal columnar epithelia was 
achieved using a constituent algorithm based on normalized, mean-scaled spectra at 
all three excitation wavelengths. This algorithm demonstrated a substantially 
improved sensitivity for a similar specificity relative to the previously developed 
constituent algoritimi (2) which used normaUzed, mean-scaled spectra at 380 nm 
excitation, only. Multivariate statistical analysis of a combination of normalized, 
mean-scaled tissue spectra at all three excitation wavelengtiis resulted in four 
principal components that demonstrate statistically significant differences between 
SlLs and normal columnar epitiieha (Table 2): These four principal components 
collectively account for 80% of the total variance of the spectral data set. Logistic 
discrimination was employed to develop a classification algorithm to discriminate 
between SILs and normal columnar epithelia. The prior probabilities were determined 
to be: 28% normal columnar tissues and .72% SD^. The optimized cost of 
misclassification of SIL was equal to 0.58. Posterior probabilities of belonging to each 
tissue type were calculated for all samples from the data set. HG. 29 illustrates the 
retrospective accuracy of tiie algoritiim applied to the , calibration data set. The 
posterior probability of being classified into .Uie SIL category is plotted for all SBLs 
and nomial colunmar samples examined. FIG. 29 graphically indicates Uiat 91% of 
HG SILs and 83% of LG SILs have a posterior probability Uiat is greater tiian 0.5. 
Seventy-six percent of colposcopically normal ..cplumnar. epithelia are correcUy 
classified witii a posterior probability less tiian 0,5.: 

The confusion matrix in Table 4 compares tiie retrospective accuracy of tiie 
constituent algorithm on the calibration data set to its prospective accuracy on tiie 
prediction set. The prospective accuracy of the algoritiim aable 4) indicates that there 
is a small increase in the proportion of correctiy classified LG SILs and a small 
decrease in the proportion of correctly classified HG SILs; tiiere is approximately a 
10% decrease in the proportion of correctly classified normal columnar tissues. Note 
that the majority of normal squamous tissues and samples witii inflammation from 
botii the calibration and prediction sets are misclassified as SIL using tiiis algoritiim. 
Evaluation of the misclassified SILs from the calibration set indicates tiiat three 
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samples (out of 16) with CIN D, three samples (out of 16) with CIN I and one sample 
(out of 7) with HPV are incorrectly classified From the prediction set, two samples 
(out of 19) with CIN m. three samples (out of 16) with CIN H, and three samples (out 
of 16) with GIN I are incorrectly classified. 

Constituent algorithm (3) which differentiates HGSILs and LG SILs 

A combination of normalized spectra at all three excitation wavelengths 
significantly enhanced the accuracy of the previously developed constituent algorithm 
(3) which differentiated HG SILs from LG SILs using normalized spectra at 460 nm 
excitation. Multivariate statistical analysis of normalized specti-a at all three excitation 
wavelengths resulted in four statistically significant principal components, that 
account collectively for 67% of the total variance of the specti^ data set (Table 2). 
Again, a probabUity based classification algorithm was developed to differentiate HG 
SILs from LG SILs. The prior probability was: 40% LG SILs and 60% HG SILs. The 
optimal cost of misclassification of HG SIL was equal to 0.51. Posterior probabilities 
of belonging to each tissue type were calculated. HG. 30 illustrates the retrospective 
accuracy of the algorithm applied to the calibration data set. The posterior probabUity 
of being classified into tiie HG SBL category is plotted for all SILs evaluated. Fig. 30 
indicates tiiat 83% of HG SILs have a posterior, probability greater than 0.5, and 70% 
of LG SILs have a posterior probability less tiian 0.5. 

The confusion matrix in Table 5 compares the retrospective accuracy of the 
constituent algorithm on the calibration set -to its prospective accuracy on the 
prediction set. Its prospective accuracy indicates tiiat tiiere. is a 5% decrease in tiie 
proportion of correctiy classified LG SILs and no change iu: the proportion of correcUy 
classified HG SIU. From die calibration set. six HG SILs are misclassified; three 
samples (out 19) with CIN m and three samples (out of 16) with CIN H are 
misclassified as LG SIL. The misclassified LG SILs comprise of five samples (out of 
16) with CIN I and two samples (out of 7) with HPV. From the prediction set. five HG 
SILs are misclassified; two samples (out of 19) with CIN IH and three (out of 16) with 
CIN n. There were ten misclassified LG SILs from tiie prediction set: seven with CIN 
I (out of 16) and three (out of 8) with HPV. 
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"Full-parameter" composite screening and diagnostic algorithms 

A composite screening algorithm was developed to differentiate SILs and non 
SILs (normal squamous and columnar epithelia and inflammation) and a. composite 
diagnostic algorithm was developed to differentiate HG SILs from non HG SILs G^G 
5 SILs, normal epithelia and inflammation). The effective accuracy of both composite 
algorithms were compared to those of the constituent algorithms from which they 
were developed and to the accuracy of current detection modalities. 
A composite screening algorithm which discriminates between SILs and non SILs 

A composite screening algorithm to differentiate SILs from non SILs was 

10 developed using a combination of the two constituent algorithms: algorithm ( 1 ) which 
differentiates SILs from normal squamous tissues and algorithm (2) which 
differentiates SELs from normal columnar epithelia. The optimal cost of 
miclassification of SIL was equal to 0.66 for constituent algorithm (1) and 0.64 for 
constituent algorithm (2). Only the costs of misclassification of SIL of the two 

15 constituent algorithms was altered for the development of the composite screening 
algorithm. These costs were selected to minimize the total number of misclassified 
samples. 

The accuracy of the composite screening algorithih on the calibration and 
prediction data sets is illustrated in the confusion matrix in Table 6. Examination of 

20 the confusion matrix indicates that the algorithin correctly classifies approximately 
90% of HG SILs and 75% of LG SIL from th6 calibration data set. Furthermore, 
approximately, 80% of normal squamous tissues and 70% of normal columnar 
epithelia froni the cahbration set are correcUy cl^sified. Evaluation of the predicUbn 
set indicates that there is a small change in the proportion of correctly classified HG 

25 SILs and LG SILs. There is a negligible change iu: the correct classification of normal 
squamous and colunuiar tissues. Note that while, 80% of samples with inflammation 
from the calibration set are incorrectly classified, as SEL, only 43% of these samples 
from the prediction set are incorrectly classified. : . . 

A comparison of the accuracy of the composite screening algorithm (Table 6) 

30 to that of each of the constituent algorithms (1) (Table 3) and (2) (Table 4) on the 
same spectral data set indicates that in general, there is less than a 10% decrease in the 
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proportion of correctly classified SILs using the composite screening algorithm 
relative to using either of the constituent algonthms independently. Note however that 
the proportion of coirecUy classified normal (squamous and columnar) epithelia is 
substantially higher using the composite algorithm relative to using either of the 
constituent algorithms independently. These results confirm that utilization of a 
combination of the two constituent algorithms, significantly reduces the false-positive 
rate relative to that using each algorithm independently. Evaluation of the 
spectroscopically misclassified SILs from the calibration set aable 6) indicates that 
only one sample (out of 19) with CIN m. three samples (out of 16) with CIN H, two 
samples (out of 16) with CIN I and four samples (out of 7) with HPV are incorrecUy 
classified. From the prediction data set (Table 6), two samples (out of 19) with CIN 
m. four samples (out of 1 6) with CIN D, three saitiples (out of 1 6) with CIN I and one 
sample (out of 8) with HPV are incorrecUy classified. 

A composite diagnostic algorithm which differ^ti^es HG SIU from non HG SILs 

A composite diagnostic algorithm which differentially detects HG SILs was 
developed using a combination of all three constituent algorithms: algorithm (1) 
which differentiates SILs from normal squainious tissues, algorithm (2) which 
differentiates STU from normal columnar epithelia and algorithm (3) which 
differentiates HG SILs from LG SILs. The optimal costs of miclassification of SIL 
was equal to 0.87 for algorithm (1) and 0.65 for algorithm (2); the optimal cost of 
misclassification of HG SE. was equal to 0.49 for algorithm (3). Only the costs of 
misclassification of SIL of constituent algorithms (1) and (2) and the cost of 
misclassification of HG SIL of conj/i/u^^^ were altered during , 

development of the composite diagnostic algorithm. These costs were selected to 
minimize the total number of misclassified samples. 

The results of the composite diagnostic algorithm on the calibration and 
prediction sets are shown in. the confusion matrix in Table 7. The algorithm correctly 
classifies 80% of HG SIU. 74% of LG SILs and more than 80% of normal epitheUa. 
Evaluation of tiie prediction set using this composite aigoriUim indicates that tiiere is 
only a 3% decrease in Uie proportion of correctiy classified HG SILs and a 7% 
decrease in the proportion of correcUy classified LG SILs. There is less than a 10% 
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decrease in the proportion of correctly classified normal epithelia. A comparison 
between the calibration and predicUon sets indicates that while more than 70% of 
samples with inflammation from the calibration data set are incorrectly classified as 
HG SIL, only 14% of samples with inflammation from the prediction set are 
incorrectly identified. Due to the relatively small number of samples examined in this 
histo-pathologic category, the results presented here do not conclusively establish if 
the algorithm is capable of correcUy identifying inflammation. 

A comparison of the accuracy of the composite diagnostic algorithm to that of 
constituent algorithm (3) which differentiates HG SILs from LG SILs (Table 5) 
indicates there is less than a 5% decrease in the proportion of correctly classified HG 
SILs and a 5% increase In the proportion of cbri-ectly classified LG SILs using the 
composite diagnostic algorithm relative to using the constituent algorithm (3). 
Evaluation of the HG SJU from the calibration set (Table 7) that were incorrecUy 
classified indicates that three samples (out of 19).vyith.qiN Ifl. and four samples (out 
of 1 6) with CIN n are incorrectly classified. From the prediction set. four samples (out 
of 19) with CIN m and five samples (out of 16) with CIN H are incorrecUy classified. 
"Reduced-parameter" composite screening and diagnostic algorithms 
Component Loadings: A component loading represents the correlation between each 
principal component and the original pre-processed fluorescence emission spectra at a 
particular excitation wavelength. HG. 3 l(a-c) illustrate component loadings of the 
diagnostically relevant principal components oS constituent algorithm (1) obtained 
from noraialized spectra at 337, 380 and 460 nm excitation. respecUvely. HG. 32(a-c) 
display component loadings that correspond to ithe diagnosUcally relevant principal 
components of constituent algorithm (2) obtained from normalized, mean-scaled 
spectra at 337, 380 and 460 nm excitation, respectively. Finally. HG. 33(a-c) display 
the component loadings corresponding to the diagnosUcally relevant principal 
components of constituent algorithm (3). obtained from normalized spectra at 337, 
380 and 460 nm excitation, respectively. ]n each graph shown, the abscissa 
corresponds to the emission wavelength range at a particular excitation wavelength 
and the ordinate corresponds to the correlation coefficient of the component loading. 
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CorrelaUon coefficients of the component loading above 0.5 and below -0.5 are 
considered to be significant. 

FIGS. 31(a), 32(a) and 33(a) display component loadings of principal 
components of constituent algorithms (1), (2) and (3), respectively, obtained from pre- 
processed spectra at 337 nm excitation. A closer examination indicates that 
component loading 1 is nearly identical for all three algorithms. Evaluation of this 
loading indicates that it is positively correlated with corresponding emission spectra 
over the wavelength range 360-440 nm and negatively correlated with corresponding 
emission spectra over the wavelength range ; 460-660 nm. All remaining principal 
components of all three algorithms display a correlation between -0.5 and 0.5, except 
component loading 4 of algorithm (2) (Fig. . 32(a)) which displays a positive 
correlation of 0.75 with the corresponding emission spectra at 460 nm. 

FIGS. 31(b), 32(b) and 33(b) display component loadings that correspond to 
the diagnostically relevant principal components of co/wftWn/ algorithms (1), (2) and 
(3), respectively obtained from pre-processed spectra at 380 nm excitation. 
Component loading 1 of all three algorithms is positively correlated with 
corresponding emission spectra over the wavelength range, 400-450 nm. Between 
500-600 nm, component loading 1 of algorithm (2) (Fig. 32(b)) is correlated 
negatively with corresponding emission spectra.^ E;tamination of component loading 3 
of algorithm (1) (Fig. 31(b)) and algorithm (3) (Fig. 33(b)) indicates that they are also 
negatively correlated with corresponding emission spectra from 500-600 nm. Only 
component loading 2 of algorithm (2) (Fig.,;32(b)) is positively coirelated with 
corresponding emission spectra from 500-600. niij.; Also note that component loading 
3 of algorithm (1) (Fig. 31(b)) and component loadings 3 and 6 of algorithm (3) (Fig. 
33(b)) display a correlation with corresponding emission spectra at approximately 640 
nm. 

HGS. 31(c), 32(c) and 33(c) display component loadings that correspond to 
the diagnostic principal components of constituent algorithms (1), (2) and (3), 
respectively obtained fi-om pre-processed spectra at 460 nm excitation. Note that only 
component loading I displays a negative correlation (< -0.5) with corresponding 
emission specti-a for all three algorithms. This component loading is correlated with 
corresponding emission spectra over the wavelength range 580-660 nm. The 
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remaining principal components of all three algorithms display a correlation between - 
0.5 and 0.5. 

The component loadings at aU three excitation wavelengths of all three 
constituent aigohthms were evaluated to select fluorescence intensities at a minimum 
number of excitation-emission wavelength pairs required for the previously developed 
constituent and composite algorithms to perform with a minimal decrease in 
classification accuracy. Portions of the component loadings of the three constituent 
algorithms most highly correlated (correlation > 0.5 . or < -0.5) with corresponding 
emission spectra at each excitation wavelength were selected and the reduced data 
matrix was then used to regenerate and evaluate the constituent and composite 
algorithms. It was iteratively determined that fluorescence intensities at a minimum of 
15 excitation-emission wavelength pairs are required to re-develop constituent and 
composite algorithms that demonstrate a minimum decrease in classification accuracy. 
At 337 nm excitation, fluorescence intensities at two emission wavelengths between 
360-450 nm and intensities at two emission wavelengths between 460-660 nm were 
selected. At 380 nm excitation, intensities at two.emission wavelengths between 400- 
450 nm and intensities at four emission wavelengths between 500-640 nm were 
selected. Finally, at 460 nm excitation, fluorescence, intensities at five emission 
wavelengths over the range 580-660 nm was selected. Table 8 lists these excitation- 
emission wavelengtii pairs for each of the \hiec constituent algorithms, (1), (2) and 
(3). These excitation-emission wavelength pairs. are also indicated on the component 
loading plots in Figs. 31-33. The bandwidth at each emission wavelengtii is 10 nm. 
Reduced-parameter composite algorithms 

Using tiie fluorescence intensities only at tiie selected excitation-emission 
wavelengtii pairs, die three constituent algorithms were re-developed using the same 
formal analytical process as was done previously using die entire fluorescence 
emission specti^ at all three excitation wavelengths (Fig. 24). The Uiree constituent 
algorithms were then independently optimized using the calibration set and tested 
prospectively on tiie prediction data set. They were combined as described previously 
into composite screening and diagnostic algorithms. The effective accuracy of tiiese 
reduced-parameter composite algoritiims were compared to that of the fuU-parameter 
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composite algorithms developed previously using fluorescence emission spectra at all 
three excitation wavelengths. 

Table 9 displays the accuracy of the reduced-parameter composite screening 
algorithm (based on fluorescence intensities at 15 excitation-emission wavelength 
pairs) which discriminates between SILs and non SILs applied to the calibration and 
prediction sets. A comparison between the calibration and prediction data sets 
indicates that there is less than a 10% decrease in the proportion of conecUy classified 
SRjs and normal squamous tissues from the prediction set. Note however that there is 
a 20% increase in the proponion of correctly classifled normal columnar epithelia and 
a 40% increase in the proportion of correctiy classified samples with inflammation 
from the prediction set. 

The accuracy of the reduced-parameter composite screening algorithm (Table 
9) was compared to that of the full-parameter composite screening algorithm (Table 
6) applied to the same spectral data set. A comparison indicates that in general tiiere is 
less than a 10% decrease in the accuracy of the . reduced-parameter composite 
algorithm relative to that of the full-parameter composite screening algorithm, except 
for a 20% decrease in the proponion of correcUy classified normal columnar epithelia 
from the calibration set tested using the reduced-parameter composite screening 
algorithm (Table 9). 

Table 10 displays the accuracy of the reduced-parameter composite diagnostic 
algorithm that differentially identifies HG SILs from the calibration and prediction 
sets. A comparison of sample classification between the calibration and prediction 
data sets indicates that there is negligible change in the proportion of correctly 
classified HG SILs. iG SILs and normal squamous epithelia. Note that there is 
approximately a 20% increase in the proportion of correctiy classified normal 
columnar epithelia and samples with inflammation from the prediction set. 

A comparison of the composite diagnostic algorithm based on the reduced 
emission variables (Table 10) to that using fluorescence emission spectra at all three 
excitation wavelengths (Table 7) applied to the same spectral data set indicates that in 
general, the accuracy of the reduced-parameter com/7C7^//e diagnostic algoritiim is 
within 10% of that reported for tiie m-psiamctcT composite diagnostic algoritiim; 
however, a comparison between Tables 7 and 10 indicates that tiiere is approximately 
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a 15% decrease and a 20% increase in the proportion of correctly classified normal 
columnar epithelia from the calibration and prediction sets (Table 10). respectively 
which were tested using the reduced-parameter compoji/e diagnostic algorithm. The 
opposite trend is observed for samples with inflammation tested using the reduced- 
parameter composite diagnostic algorithm (Table 10). 

Table 11 compares the sensitivity and specificity of the full-parameter and 
reduced-parameter composite algorithms to that of Pap smear screening and 
colposcopy in expert hands. Table 11 indicates that the composite screening 
algorithms have a similar specificity and a si^ficantly improved sensitivity relative 
to Pap smear screening. A comparison of the sensitivity of the composite screening 
algorithms to that of colposcopy in expert hands for differentiating SILs from non 
SJLs indicates that these algorithms deraonsttate a 10% decrease in sensitivity, but a 
20% improvement in specificity. The composite diagnostic algorithms and colposcopy 
in expert hands discriminate HG SILs from , npn^ HG SILs with a very similar 
sensitivity and specificity. Also note that the variability (standard deviation) of both 
Pap smear screening and colposcopy in expert hands is substantially higher than that 
of the full-parameter and reduced-parameter spreeping and diagnostic algorithms. A 
comparison between the fiiU.parameter and reduced-parameter composite algorithms 
indicates that the algorithms based on the reduced emission variables demonstrate a 
minimal decrease in classification accuracy relative to those that employ fluorescence 
emission spectra at all three excitation wavelength?. 
Discussion and Conclusions ■ .•• ■/ ^ 

Cervical tissue fluorescence spectra retorded at 337, 380 and 460 nm 
excitation can be used to develop composite screening and diagnostic algorithms for 
the differential detection of SILs in vivo. The composite screening algorithm 
discriminates between SILs and non SILs with a similar specificity and a substantially 
improved sensiUvity relaUve to standard Pap smear screening. When compared to 
colposcopy in expert hands, the composite screening algorithm displays a 10% 
decrease in sensitivity but almost a 20% improvement in specificity. A comparison 
between the composite diagnostic algorithm and colposcopy in the hands of expert 
practitioners indicates that both have a very similar sensitivity and specificity for 
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discriminating between HG SILs and non HG SELs. Note that as spectroscopic 
interrogation of diseased and non-diseased cervical tissue sites in the current clinical 
study was directed by colposcopic impression, the sensitivity of the spectroscopic 
algorithms could not exceed the sensitivity of colposcopy. Ih other words, if there 
5 were histologically diseased cervical tissue sites that were overlooked by colposcopy, 
these false-negatives were not be evaluated spectroscopically. As a result, the 
potential of fluorescence spectroscopy to comecUy classify these false-negatives could 
not be determined. 

The full-parameter composite algorithms were re-developed using 

10 fluorescence intensities at 15 excitation-emission wavelength pairs, to generate 
reduced-parameter composite algorithms. The fluorescence intensities at these reduced 
number of excitation-emission wavelengtii pairs were selected using a parameter 
called the component loading calculated from the principal components. Evaluation of 
the reduced-parameter composite algorithms indicates tiiat they display a minimal 

15 decrease in sensitivity and specificity relative, to the fuU-parameter composite 
algorithms. The reduction in the number of excitation-emission wavelengtii pairs from 
161 to 15 unplies reduction in tiie complexity and cost of tiie portable fluorimeter 
which would be used to measure cervical tissup .fluorescence. For example, if 
fluorescence intensities at only 15 excitation^eniissipn wavelength paks need to be 

20 measured, tiie polychromator and intensified .diode . array can be replaced by a 
mechanical filter assembly and a single channel detector. This represents a substantial 
decrease in cost and complexity of this instrumentation at the expense of less tiian a 
1% decrease in sensitivity. 

Several significant improvements arid refinements have been made in 

25 previously developed constituent algorithms using tissue spectra at aU tiiree excitation 
wavelengths. Previously, the constituent algorithm (1) which differentiates SILs from 
normal squamous epithelia was developed using normalized, mean-scaled spectra at a 
single excitation wavelength: 337 nm. Spectra at this excitation wavelengtii had to be 
mean-scaled in order to calibrate for the significant inter-patient variation in spectral 

30 line shape. This algorithm demonstrates the greatest classification accuracy when the 
patient being evaluated has equal numbers of diseased and non-diseased tissue sites. 
This restriction clearly reduces the clinical effectiveness of tiiis algoritiim. The new 
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algorithm which is based on normalized emission spectra at all three excitation 
wavelengths, minimizes this inter-patient variation and hence obviates the need for 
mean-scaling, while maintaining a slightly improved classification accuracy. Inclusion 
of spectra at addiUonal excitation wavelengths represents a significant improvement in 
the clinical effectiveness of this algorithm as it can be applied to a much wider 
population of patients. 

The accuracy of previously dtv&lopcd constituent algorithm (2) which 
discriminates between SILs and normal columnar epithelia was significantly improved 
by using nonnalized, mean-scaled spectra at all three excitation wavelengths rather 
than at a single excitation wavelength. Despite the significant improvement in these 
results, this algorithm is also based on tissue spectra that require mean-scaling at each 
excitation wavelength. A multivariate statistical algorithm based on normalized 
spectra only, at all three excitation wavelengths differentiates STLs from normal 
columnar epithelia with a significantly poorer .seixsitivity than the algorithm that uses 
normalized, mean-scaled spectra at all three excitation wavelengths. Therefore, mean- 
scaling is essenUal for the opUmal operation of this algorithm. 

Fmally, an improvement that is significant is the development of the third 
constituent algorithm which discriminates between LG SDLs and HG SILs using tissue 
spectra at all three excitation wavelengths. The utilization of spectra at all three 
excitation wavelengtiis results in a substantia} improvement in sensitivity relative to 
using the constituent algorithm (3) which is based pn a single excitation wavelengtii. 
Furthermore, spectra required for tiiis algorithm do .not have to be mean-scaled for 
inter-patient variation in spcictral line shape. 

Each of tiie three co/wftrue/ir algorithms developed using specti^ data from 
the current cUnical study discriminate between a specific pair of tissue types. Using 
each constituent algoritiun, a posterior probability assignment of an unknown sample 
to a particular tissue category is calculated using a set of diagnosUcally relevant 
principal components Uiat demonsu-ate statistically significant differences between tiie 
two tissue types under consideration. The posterior probability output of the 
constituent algoritiuns are then combined to ..develop composite screening and 
diagnostic algoritiams tiiat discriminate between many of the clinically relevant tissues 
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types. Hence, development of the two composite algorithms is based on the prior 
development of the three constituent algorithms; 

To test the feasibility of an alternate approach, the two compoji/g algorithms 
were developed directly from diagnostically relevant principal components of their 
corresponding constituent algorithms, thereby by-passing the consHtuent algorithm 
development phase. The composite screening algorithm which discriminates between 
SILs and non SILs was developed using logistic discrimination based on the 
diagnostically relevant principal components of co/i^rfme/if algorithms (I) and (2); the 
posterior probability of an unknown sample being classified as either SIL or non SIL 
was calculated. The composite diagnostic algorithm which discriminates between HG 
SILs and non HG SILs was developed using logistic discrimination based on the 
diagnostically relevant principal components of constituent algorithms (1). (2) and (3); 
the posterior probability of an unknown sample being classified as either HG SEL or 
non HG SIL was calculated. The composite algorithms developed directly firom the 
diagnostically relevant principal components of their corresponding co«j//toe«/ 
algorithms demonstrated a poorer classification, accuracy relative Xo composite 
algorithms that were developed using a combination of corresponding constituent 
algorithms. Therefore, compoji/e screening and diagnostic algorithms were developed 
using a combination of independently iit\t\o^^ constituent algorithms. 

Pre-processing to remove inter-patient apd intra-patient variation prior to the 
development of the multivariate statistical algorithm may remove the spectral 
variations that may be significant from a biological standpoint. However, in the 
development of multivariate statistical screening and diagnostic algorithms that can 
successfully identify disease in any given patient, tiie mtia-patient and mter-patient 
spectral variations must be removed if they do obscure the important inter-category 
differences that tiie algorithm needs to extract. If a sophisticated physical model can 
be developed to describe the biological basis of the spectral data as well as the inter- 
patient and intra-patient spectral variations accurately, then this information can be 
used to develop better methods of pre-processing or direct the need for additional 
measurements to calibrate for these variations. This is an important issue to address 
and is currently the subject of study in our laboratory. 
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In spite of the successful development of algorithms that can differentiate (1) 
SILs from normal tissues and (2) HG SILs from non HG SILs and normal epitheUa, 
these algorithms do not consistently classify samples with inflammation as non SIL; 
this results in a decrease in their specificity. Although the number of samples 
examined in this histo-pathologic category is limited, analysis from previous and 
current clinical studies indicates that it relatively difficult to correctly classify these 
samples. A plausible explanation for this is that (1) the current excitation wavelengths 
used may not be optimum for identification of fluorophores that are unique to 
inflammation and/or (2) the penetration depth of the light may not be sufficienUy long 
to spectroscopically interrogate tiie underlying stromal layers where inflammation 
develops. 

The specificity of fluorescence spectroscopy for the detection of cervical 
neoplasia may be improved by using fluorescent photosensitizers to enhance the 
contrast between neoplastic and non-neoplaslic tissues m v/vo. The use of 
photosensitizers such as photofrin. hematoporphyrin derivative or 5- ALA may 
potentially enhance the spectiroscopic differences between neoplastic and non- 
neoplastic (normal and inflammatory) cervical tissues and hence contribute to an 
improved specificity of tiie spectroscopic algorithms. 

Another lunitation is Uiat the portable fluorimeter described in tiiis Example to 
measure in vivo tissue fluorescence spectta utilizes a single-pixel probe that 
interrogates a 1 mm diameter area on tiie cervix. Altiiough, the. single-pixel probe that 
tiie inventors have used provides tiie capability to determine whether a small region of 
cervical tissue contains pre-cancerous changes, mapping tiie entire cervk with this 
system is exti^emely time consuming, making wide-scale application of this 
technology impractical. To address tiiis limitation, a multi-pixel probe that can be 
used to acquire fluorescence spectra from multiple sites on the cervix, simultaneously 
may be used. This may provide to a user not only information regarding the presence 
of pre-canccr but can also indicate its location and extent. 

In sununary, in vivo fluorescence specti-oscopy has tiie capability to 
significantly improve the sensitivity of Pap smear screening and the specificity of 
colposcopy in expert hands. Hence, tiiis technique may play an important clinical role 
as a screening / re-screening tool (to screen women who have already had an initial 



wo 99/57529 PCT/US99/09768 

67 .. 

positive Pap smear, but who have not undergone colposcopy and directed biopsy) and 
as an adjunct to colposcopy in expert hands. Advantages realized by using this 
technique include, but are not limited to: (1) screening and diagnostic information 
may be obtained in near real-time and (2) this techmque may be easily automated 
hence reducing the need for subjective interpfcitation. Furthermore, while the Pap 
smear examines only exfoliated cervical epithelial cells, fluorescence spectroscopy 
may interrogate the full thickness of the epithelium. 
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EXAMPLE 3 

Head and Neck Analysis- Fluorescence 

Analysis of fluorescence data collected in a clinical head and neck study has 
been analyzed in accordance with the present disclosure. The Example that follows 
describes analysis of these data. 

Materials and Methods 

Fluorescence excitation envission matrices were measured in vivo from sixty 
two sites in 9 normal volunteers and 11 patients with a known or suspected 
premalignant or malignant oral cavity lesion. Excitation wavelength ranged from 330 
to 500 nm and emission wavelength ranged from 340 to 600 nm. Fluorescence data 
were analyzed to determine which excitation and emission wavelengths contained the 
most diagnostically useful information and to estimate the performance of diagnostic 
algorithms based on this information. Algorithms were developed based on 
combinations of emission spectra at various excitation wavelengths in order to 
determine which excitation wavelengths contained the most diagnostic information. 
Then, at those excitation wavelengths, algorithms were developed based on reduced 
numbers of emission wavelengths to determine whether complete emission spectra 
were required or whether accurate diagnosis could be made using multi-spectral 
measurements at a few excitation/emission wavelength combinations. The algorithm 
development process, consisted of the following steps: (1) data pre-processing to 
reduce inter-patient variations, (2) data reduction to reduce the dimensionality of the 
data set, (3) feature selection and classification to develop algoritimis Which maximize 
diagnostic performance and minimized the likelihood of over-training in a training set, 
(4) unbiased evaluation of tiiese algorithms using the technique of cross-validation. 
Results 

The optimal excitation wavelengtiis for the in vivo detection of oral cancers 
With fluorescence spectroscopy were found to be 350, 380 and 400 nm. An unbiased 
estimate of an algorithm based on the entire emission spectra at these excitation 
wavelengtiis yields a sensitivity of 100% and speciflcity of 88%. Increasing tiie 
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number of excitation wavelengths did not improve algorithm performance. Better 
algorithm performance was obtained when data were normalized to the peak emission 
intensity of the concatenated vector than Ay^^^^^^^ was 
normalized to its own peak emission wavelength. The number of emission 
wavelengths could be significanUy reduced without compromising algorithm : 
performance. When only a single emission wavelength of 472 nm, common to all 
three excitation wavelengths, was used algorithm performance on cross validation was 
90% sensitivity and 88% specificity. The unbiased performance estimate for the 
diagnostic algorithms based on fluorescence spectroscopy have a higher sensitivity 
than cun-ent visual screening techniques done by experts. 
Study Subjects 

9 normal volunteers and 1 1 patients with a known or suspected premalignant 
or malignant oral cavity lesion were recruited to participate in the study at the Head 
and Neck Surgery Clinical at The University of Te'xas M.D: Anderson Cancer Center. 
Written informed consent was obtained from each "person in the study. 
Instrument 

A FastEEM system in accordance with the present disclosure was used for this 
study. Briefly, the system measured fluorescence emission spectra at 18 excitation 
wavelengths, ranging from 330 nm to 500 nm in 10 nm increments. The system 
incorporated a fiberoptic probe, a Xenon arc lamp coupled to a monochromator to 
provide excitation light and a polychroraator and thermo-electrically cooled CCD 
camera to record fluorescence intensity as a function of emission wavelength. 
Calibration 

A background EEM, to be subtracted from the acquired patient data, was 
obtained with the probe immersed in a non-fluorescent bottie filled with distilled 
water at the beginning of each measurement day. Then a fluorescence EEM was 
measured with the probe placed on the surface; of a quartz cuvette containing a 
solution of Rhodamine 610 (Exciton. Dayton, OH) dissolved in etiiylene glycol (2 
mg/mL). 
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To correct for the non-uniform spectral response of the detection system, the 
spectra of two calibrated sources were measured; in the visible an NIST traceable 
calibrated tungsten ribbon filament lamp was used and in the UV a deuterium lamp 
was used (550C and 45D, Optronic Laboratories Inc. Orlando, FL). Correction factors 
were derived from these spectra. Background subtracted EEMs from patients were 
then corrected for the non-uniform spectral response of the detection system. 
Variations in the intensity of the fluorescence excitation light source at different 
excitation wavelengths were corrected using measurements of the intensity at each 
excitation wavelength at the probe tip made using a calibrated photodiode (818-UV, 
Newport Research Corp.), Finally, corrected fluorescence intensiUes from each site 
were divided by the fluorescence emission intensity of the Rhodamine standard at 460 
nm excitation, 580 nm emission. Thus, data illustrated in this paper are not the 
absolute fluorescence intensities of tissue but rather the intensities relative to the 
Rhodamine standard. , . ,.= . 

Data Aquisition '-• 

Before the probe was used it was disinfeicted with Metricide (Metrex Research 
Corp.) in accordance with standard protocol. The probe was then guided into the oral 
cavity and its tip positioned flush with the mucosa. Then fluorescence EEMs were 
measured. 

Fluorescence EEMs were measured from 9 volunteers with no history of oral 
cavity neoplasia at 35 clinically normal sites in the oral cavity (table 1). No biopsies 
were obtained from volunteers. Following visual screening in 11 patients with a 
known or suspected premaligiiant or malignant oral cavity lesion, fluorescence EEMs 
were measured from 27 sites (Table 1). The physician placed the fiber optic probe on 
a lesion or suspected lesion and the fluorescence of that site was measured. In 
addition to the three to five visually abnormal sites, fluorescence EEMs were 
measured from one to three contralateral normal sites. Post-spectroscopy, abnormal 
sites were tattooed with India Ink where the probe measured the spectra. A cUnical 
diagnosis of each lesion as normal, abnormal (not dysplasUc), abnormal (dysplastic) 
or cancerous was recorded by an experienced head and neck surgeon (AMG) or dental 
oncologist (RJ). During follow up surgery, a 2-A nun biopsy of the tissue was taken 
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from the tattooed area. These specimens were evaluated by an experienced 
pathologist (BK) using light microscopy and classified as normal, mucosal reactive 
atypia (MRA). dysplasia or cancer using standard .diagnostic criterion. Biopsies with 
multiple diagnoses were classified according, to the most severe pathological 
diagnosis. The pathologist and clinicians were blinded to the results of the 
spectroscopic analyses. 

Data Review 

A total of 88 sites were measured from 26. subjects. All spectra were reviewed 
by a single investigator blinded to the pathologic results (DLH). Spectra were 
discarded if files were not saved properly due to software error (8 sites), instniment 
error (2 sites), operator error (4 sites), probe movement (3 sites), and the presence of 
room light artifacts at wavelengths below 600 nm (3 sites) in at least one of the 
emission spectra. From the remaining sites, specura from six sites were excluded 
because the tattoo could not be located and consequenUy reliable histologic diagnosis 
was not available for these sites. Therefore, fluoipscence EEMs from 62 sites from 20 
subjects were available for further analysis (Table 1). 
Data Analysis 

Fluorescence data were analyzed to determine which excitaUon and emission 
wavelengths contained the most diagnostically useful informauon and to estimate the 
performance of diagnostic algorithms based on this information. Algorithms based on 
multi-variate discriminant analysis were considered: Algorithms based on 
combinaUons of emission spectra at various excitation wavelengths were developed in 
order to determine which excitation wavelengths contained the most diagnostic 
information. Then, at those excitation wavelengths, spectra based on reduced 
numbers of emission wavelengths were developed to determine whether complete 
emission spectra were required or whether accurate diagnosis could be made using 
multi-spectral measurements at a few excitation/emission wavelength combinations. 

In each case, the algorithm development process, described in detail below, 
included the following major steps: (1) data pre-processing to reduce inter-padent 
variations. (2) data reduction to reduce the dimensionality of the data set. (3) feature 
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selection and classification to develop algorithms which maximized diagnostic 
perfonnance and minimized the likelihood of over-training in a training set, (4) 
unbiased evaluation of these algorithms using the technique of cross-validation. 
Diagnostic Categories 

5 Multi-variate discriminant algorithms were s6ught to separate two Ussue 

categories: normal and abnormal. The abnonnal class contained sites with dysplasia, 
carcinoma in situ and squamous cell carcinoma; the normal class contained sites 
which were clinically and/or histologically normaJ as well as benign changes such as 
inflammation. 

10 Data Pre-Processing 

Fluorescence data from a single measurement site is represented as a matrix 
containing calibrated fluorescence intensity as a function of excitation and emission 
wavelength. Columns of this matrix correspond to emission spectra at a parUcular 
excitaUon wavelength; rows of this matrix correspond; to excitation spectra at a 

15 particular emission wavelength. Each excitation spectrum contains 18 intensity 
measurements; each emission spectrum contains between 50 and 130 intensity 
measurements depending on the excitation wavelength. Most multi-variate data 
analysis techniques require vector input rather than matrix input, so the column 
vectors containing the emission spectra at excitation wavelengths selected for 

20 evaluation were concatenated into a single vector in order to explore which excitauon 
wavelengths contained the most diagnostic information. 

Our previous work illustrated that spectra of oral cavity obtained /« viVo show 
large patient to patient variaUons in intensity that can be greater than the iriter- 
category differences. Therefore, the inventors explored pre-processing methods to 

25 reduce the inter-patient variations, while preserving inter-category differences. While 
many different methods of pre-processing are possible, two methods were selected for 
evaluation here: (1) noraialization of all emission spectra of a given excitation 
wavelength combinaUon to the maximum intensity contained within that combination, 
and (2) normalization of each emission specu-a to its maximum intensity. 
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Reduction of Excitation Wavelength Number 

In this study, fluorescence emission spectra were measured at 18 different 
excitation wavelengths: One goal of data analysis was to determine which 
combination of excitation wavelengths contains the most diagnostic information. The 
inventors considered combinations of up to foiir emission speeds Limiting the 
number of wavelengtiis to four allows for construction of a reasonably cost-effective 
clinical spectroscopy system. Two strategies were considered to identify the optimal 
wavelength combination. The first was to identify the single wavelength which gives 
tiie best diagnostic perforaiance, then the wavelength of those remaining that most 
improves diagnostic performance, and so fortii until performance no longer improves 
or four wavelengths have been selected. The second method was to evaluate all 
possible combinations of up to four wavelengths chosen from die 18 possible 
excitation wavelengUis. This equates to 18 combinations of one. 153 combinations of 
two. 816 combinations of three, and 3.060 combinations of four excitation 
wavelengths, for a total of 4.047 combinations,.,While. the first metiiod requires less 
computational time, it is only appropriate for normalization methods tiiat remove 
relative intensity information. Otiierwise. the bestsingle wayelengtii may not be part 
of the best wavelength pair that exploits differences in relative intensity. The second 
method can be used witii either normalization scheme and in addition, provides a tool 
to rank the top wavelength combmations. rather tiian identifying tiie single best 
wavelength combination, so Uiis method was pursued. 
Algorithm Development 

For each of the 4,047 combinations of one to fbiir excitation wavelengtiis, 
spectra from the entire data set were used as a training set to develop multi-variate 
algorithms to separate nomial and abnormal tissues based on their fluorescence 
emission spectra at all possible wavelength combinations. Algoritiun development 
included of three steps: (1) pre-processmg, (2) data reduction and (3) development of 
a classification algorithm which maximized diagnostic performance. Data were pre- 
processed using the two normalization schemes described above. For each 
normalization, principal component analysis was performed using the entire dataset 
and eigenvectors accounting for 65. 75. 85. and 95% of tiie total variance were 
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retained. Principal component scores associated with these eigenvectors were 
calculated for each sample. Discriminant functions were then formed to classify each 
sample as normal or abnormal. The classificaUon was based on the Mahalanobis 
distance, which is a multivariate measure of the separation of a point from a dataset in 
n-dimensional space. Each sample was held out one at a time and the Mahalanobis 
distances between to the held out sample and the remaining normal and abnormal 
samples were calculated; the sample was classified according to the category 
corresponding to the smallest distance. The ; sensitivity and specificity of the 
algorithm were then evaluated relative to diagnoses based on histopathology (in 
patients suspected to have oral cavity malignancy) or clinical impression (in normal 
volunteers). Overall diagnostic performance was evaluated as the sum of the 
sensitivity and the specificity, thus minimizing the number of misclassifications 
(when prevalence of disease and normal are approximately equal). The performance 
of the diagnostic algorithm depended on the priRcipal component scores which were 
included. Four different diagnostic algorithms v^ere developed using principal 
component scores derived from eigenvectors accounting for increasing amounts of 
total variance. From tiie available pool of .principle component scores, the single 
principal component score yielding the best initial performance was identified, and 
then the principal component score that most improved this performance was selected. 
This process was repeated until performance is no longer improved by the addition of 
principal components scores, or all available scores wpre selected. The pool of 
available eigenvectors is specified by a variance criterion, eigenvector significance 
level (ESL). tiiat represents the ininimum variance fraction accounted for by the sum 
of die n largest eigenvalues. In this work the inyentors examined 4 ESLs, 
corresponding to 65%. 75%. 85% and 95% of die total variance 
Comparing Performance of Various Excitation Wavelength Combinations 

At each ESL. the wavelengtii combinations were ranked in order of decreasing 
perfomiance. based upon the sum of sensitivity and specificity. The combinations 
were ranked and evaluated based upon training performance. However, as the ESL 
approaches 100%. over-training becomes more 'likely, since the available pool of 



wu»»/S75zy PCTAJS99/09768 

75 • 

eigenvectors will account for nearly 100% of the variance, including variance due to 
noise. The magnitude of diagnostically important variances is unknown. 

The risk of over-training risk was assessed at the top 25 wavelength 
combinations of two, three, and four excitation wavelengths, by comparing the 
training set performance to the performance of an algorithm developed from the same 
data after the diagnoses corresponding to each measurement site had been 
randomized. This provides a dataset with the same variance structure as the original 
dataset, but where the diagnostic performance is not expected to exceed that of 
chance. In order to make equivalent comparisons, the disease prevalence in the real 
sample was maintained in the randomly assigned diagnoses. Diagnostic algorithms 
were then developed again which minimized the number of misclassified samples at a 
specified eigenvector significance level (ESL). Random diagnoses were assigned fifty 
times for each wavelength combination and the average and standard deviation of the 
sum of the sensitivity and specificity were calculated. Ideally, for completely 
normally distributed data, the sum of the sensitivity and specificity should be one for 
the randomized diagnosis at all levels of training significance. However, if over- 
training occurs, this sum will be greater than, one.. The top 25 wavelength 
combinations were then ranked again based on the absolute difference between the 
training set performance and random diagnosis assignment. This method allows the 
top wavelength combinations to be ranked in order of their robustness, or lack of 
propensity to over-train. For a given number of wavelengths per combination, the 
differences were ranked across all four eigenvector significance levels. The largest 
difference, usually seen at ESL values of 65%, >yas selected as the optimal wavelength 
combination. This criterion selects the wavelength combination that is least prone to 
over-training. 

Validation of Algorithm Performance 

Although the optimal wavelength combination has been identified based upon 
comparison of its performance to that which can be achieved when the tissue 
diagnoses have been randomized, our estimates of algorithm performance are still 
biased since they are based on the same u-aining set used to develop the algorithm. An 
unbiased performance estimate must be made to assess the Due potenUal of this 
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wavelength combination. The effects of over-training in performance estimation can 
be minimized by using separate training and validations sets, or by using the method 
of cross-validation. The data set here was not sufficiently large to divide into separate 
training and validation sets, therefore the inventors used the cross-validation method. 
In this method, all data from one patient are temporarily removed from the data set. 
the algorithm is developed using the remaining data set. and then the new algorithm is 
applied to the left out sites. This is repeated until data from each patient has been left 
out once. Cross validation was used to provide an unbiased estimate of tiie 
performance of the top three combinations of excitation wavelengtiis with each 
normalization. 

Reduction of Emission Wavelength Number 

The inventors investigated whether effective diagnostic algorithms could be 
developed using reduced numbers of emission wavelengths at the top performing 
excitation wavelengUi combinations. The inventors calculated the component 
loadings associated with tiie eigenvectors corresponding to the principal component 
scores selected in these algorithms. A component loading represents the correlation 
between each principal component and the original pre-processed fluorescence 
emission spectra at each excitation wavelength: The component loadings at each 
excitation wavelength were evaluated to select fluorescence intensities at a minimum 
number of excitation-emission wavelength pairs required for the algorithms to 
perform wiUi a minimal decrease in classification accuracy. Portions of Uie 
component loadings most highly correlated (correlation >0.5 or <-0.5) with 
corresponding emission spectia at each excitation wavelengtii were selected and tiie 
reduced data matrix was then used to regenerate and evaluate the algorithms 
Results 

Fluorescence EEMs from 62 sites from 20 subjects were available for furtiier 
analysis (Table 1). Of these 62 sites. 37 were measured from tiie tongue, eight from 
the floor of mouth (FOM). seven from the buccal mucosa, four from tiie gingiva, one 
from the palate, and five from the lip. There were 52 normal, four dysplastic. and six 
cancerous sites. The data set consisted of two types of normal sites: adjacent normals 
and normals from a population without oral cancer. Adjacent normals are the visually 
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normal sites taken from paUents that have suspected lesions elsewhere in the oral 
cavity. In this data set there were 17 adjacent hormal (histologically normal) sites 
from eleven patients, and 35 visually noraial sites taken from nine patients. 

The visual screening accuracy of the head and neck physicians for this data set 
was 100% sensitivity and 83% specificity. This performance was determined by . 
comparing the visual impressions of the clinicians to the histologic findings upon 
excision. Results of the analysis of the spectroscopic data are presented according to 
the normalization method used. 

Normalization by peak emission intensity of the concatenated vector 

The top 25 combinations of one to four excitation wavelengths were ranked in 
order of the largest difference in the sum of the sensitivity and the specificity in the 
training set and the average perfomiance with randomly assigned diagnoses. The top 
3 combinations correspond to the following excitation wavelength combmations: (350 
380 400 480). (350 380 400 490). and (350 380 400). All of these combinations 
demonstrate approximately the same training set performance, with 100% sensitivity 
and 90% specificity. These combinations have three wavelengths in conmion. Since 
no performance benefit was observed when a fourth wavelength was added for the top 
performing combinations, combinations of four wavelengths were not pursued any 
further. The top 25 combinations of three excitation wavelengths, ranked in order of 
the largest difference in the sum of the sensiUvity and the specificity in the training set 
and the average performance with randomly assigned diagnoses are given in Table 2. 
The ranking of each combination based upon training set performance is given as 
well. Table 2 gives the diagnostic performance of each combination for both the 
training set and the average performance for the data set with randomized diagnosis. 
The random diagnosis performance demonsu-ated that the combinaUons showed 
varying propensities to over-train. 

A histogram depicting the frequency at which each wavelength appeared in the 
top 25 combinations fi-om Table 2 is shown in Figure 34 for various ESLs. At low 
ESL values of 65%. 75% and 85% the diagnostic importance of excitaUon at 350. 
380. and 400 nm is evident. This is seen in the histograms for wavelength 
combinaUons of two and four as well (data not shown). 
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To provide an unbiased estimate of performance of these algorithms, the 
diagnostic performance of the top wavelength combinations was evaluated by using 
the method of cross-validation using the foil data set. The wavelength combination 
(350. 380. 400 nm) demonstrated a cross validation performance of 100% sensitivity 
and 88% specificity. The other two combinations (350, 380, 400, 480 nm) (350, 380, 
400. 490 nm) demonstrated identical perforaiance upon cross validation with a 
sensitivity of 100% and a specificity of 90%. 

The emission spectra corresponding to all 62 sites at the three excitation 
wavelengths common to these combinations are shown in Fig. 36. Visual 
examination of Fig. 36 confirms the diagnostic potential of Uiis wavelength 
combination. The identified combinations demonstrate the importance of the relative 
intensities as seen foUowing normalization to the maximum intensity in the 
concatenated emission vector. With tiiis normalization, the normal sites demonstrate 
greater fluorescence intensity at 380 nm excitation. ,450 nm emission than the 
abnormal sites. Additionally, tiie remaining emission peaks tend to be more intense in 
normal sites than for abnormal sites m most instances. The normal sites misclassified 
as abnomial are easily seen in Figure 36. Uistplpgically. these sites demonstrated 
increased vascularity, suggesting- tiiat tiie increased hempglobui absorption is one 
cause of the reduced relative fluorescence intensity from these, sites. 

The algorithm based on the combination of 350, 380 and 400 nm excitation 
wavelengths selected only a single principal component score, associated with the 
eigenvector that accounted for most of the total variance, Figure 37 shows this 
eigenvector and the associated component loading. The eigenvector depicts die 
general hneshape of the normalized spectra shown in Figure 37. The component 
loading shows tiiat die principal component score for tiiis eigenvector is highly 
correlated to approximately four regions of the concatenated emission vector. Single 
emission intensities within these ranges were selected arbitrarily and are denoted as 
soUd green cucles in Figure 37. These points correspond to tiie emission intensities of 
418 and 470 nm at 350 nm excitation, 448 nm emission at 380 nm excitation, and 
502 nm emission at 400 nm excitation. An algorithm was developed using tiie same 
data reduction and classification methods as above based upon this reduced data set. 
The training performance of the reduced algorithm is 100% sensitivity and 90% 
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specificity, and the cross-validated performance is 90% sensiUvity and 909b specificity 
compared to 100% sensitivity and 88% specificity of for the algorithm based on the 
entire emission spectra. This algorithm uses a higher ESL of 95% since the reduced 
data set contains less variance introduced by noise. Motivated by the desire to 
construct a simple device that could interrogate or image large areas of tissue, a 
reduced algorithm based upon a single emission wavelength was evaluated. The 
emission wavelength chosen was common to aU three emission spectra, 472 nm. The 
training performance of this reduced algorithm was 100% sensitivity, 88% specificity, 
and upon cross validaUon it was 90% sensitivity and 88% specificity. 
Normalization of each emission spectra by its peak emission prior to concatenation 

The analysis was repeated using concatenated vectors in which each emission 
spectrum was normalized to its peak intensity. This method removes relative intensity 
information and relies on differences in fluorescence lineshape. The maximum 
difference between training performance and the perfonnance after random diagnosis 
assignment was 0.58 compared to 0.82 using the other normalization method. 
Consequently, the top wavelength combination identified (350, 380, 400, 430 nm) 
showed poor performance upon cross-validation with a sensiUvity of 50% and a 
specificity of 88%. It is interesting to note that the previously identified wavelengths, 
(350, 380. 400 nm) are also a part of this combination, indicaUng that the line shape at 
these wavelengths contains diagnostic information. 
Discussion and Conclusions 

This Example identified the optimal excitation wavelehgths for in vivo , 
detection of oral cancers with fluorescence spectroscopy. The optimal excitation 
wavelengths were found to be 350. 380 and 400 nm. An unbiased estimate of an 
algorithm based on the entire emission spectra at these excitation wavelengths yields a 
sensitivity of 100% and specificity of 88%. Increasing the number of excitation 
wavelengths did not improve algorithm performance. Better algorithm performance 
was obtained when data were normalized to the peak emission intensity of the 
concatenated vector than when each emission spectrum was normalized to its own 
peak emission wavelength. The discriminating ability of this wavelength combination 
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is due to differences in both relative intensity and spectral Une shape. The number of 
emission wavelengths could be significantly reduced as well without compromising 
algorithm performance. An algorithm based on four emission intensities: 418 and 470 
nm at 350 nra excitation. 448 nm emission at 380 nm excitation, and 502 nm emission 
at 400 nm excitation yielded 90% sensitivity and 90% specificity upon cross- 
validation. When only a single emission wavelength of 472 nm, common to all three 
excitation wavelengths, was used algoritfmi performance on cross validation was 90% 
sensitivity and 88% specificity. 

The unbiased performance estimate for the diagnostic algoriUmis based on 
fluorescence spectroscopy have a higher sensitivity Uian current visual screening 
techniques done by experts. In tiieir hands, visual screening has been reported to have 
a sensitivity of 74% and specificity of 99%. The performance of visual screening by 
experts in this study was 100% sensitivity, 83% specificity. 

It is interesting to note that emission specU-a;pbtaine4 at 400 nm excitation are 
included in a majority of the top combinations. Hemoglobin has a strong absorption 
maximum near this location, suggesting diat differences in absorption due to perfiasion 
may offer diagnostic information. This suggests that tiie. combinations of reflectance 
and fluorescence spectroscopy may offer improved diagnostic performance. 
Head and Neck Analysis- Reflectance ' 

A FastEEM system was also used to measure tissue reflectance spectra over 
the visible region of tiie spectrum at tiiree source-detector fiber separations. The 
inventors have analyzed these data with at least two goals: (1) to determine the 
diagnostic potential of reflectance spectroscopy for detection of neoplasia 
cavity, and (2) to determine the combined diagnostic potential of fluorescence and 
reflectance specu-oscopy for detection of neoplasia of tiie oral cavity. 
Study Design 

9 normal volunteers and 1 1 patients with a known or suspected premalignant 
or malignant oral cavity lesion were recruited to participate in the study at tiie Head 
and Neck Surgery Clinical at The University of Texas M.D. Anderson Cancer Center. 
Written inforaied consent was obtained from each person in tiie study. 
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Instrument 

The spectroscopic system used to measure reflectance spectra has been 
described in detail previously and is briefly summarized here. It includes of a Xenon 
arc lamp and a 295 mn long-pass filter which pro vides broadband illumination, a fiber 
optic probe which directs light to the tissue and collects diffusely reflected light from • 
three locations (position 1. position 2. position 3). and an imaging spectrograph and 
CCD which detects the reflected light intensity as a function of wavelength. Fibers for 
iUmnination and collection of diffuse reflectance are arranged in a ring at the edge of 
the probe. The collection fibers are located 1.1, 2.1 and 3 mm from a single 
iUmnination fiber. All fibers have a core diameter of 200 microns. White light from 
the Xe lamp is coupled to tiie proximal end of the illumination fiber. The distal ends 
of the fibers are flush with the probe tip and placed in direct contact with the sample 
surface. Using tiiis system, oral cavity tissue reflectance spectra from 390-590 mn 
with a spectral resolution of 4 nm were collected in approximately 30 seconds. The 
signal to noise ratio exceeded 75: 1 for 90% of the data. 
Procedure 

Reflectance spectra were wavelength calibrated with a mercury light source. 
Dark current and background were recorded befpre each measurement with the same 
settings but with illumination turned off. These background measurements were 
subtracted from each reflectance measurement offline. Reflectance data are reported 
relative to a 2.68% by volume solution of 1.072 micron diameter polystyrene 
microspheres (Polyscience Inc., Wanrington. placed on die 

outside wall of a 1 cm path length cuvette containing the microsphere solution. The 
total integrated reflectance of this standard was measured on a double beam 
spectrophotometer (U-3300 Hitachi. Tokyo. Japan) witii an integrating sphere 
attachment (I^bsphere Inc.. North Sutton. NH). This v,as used to correct tiie 
reflectance measurements of the microsphere solution made with the spectroscopic 
system. Tissue spectra at each collection fiber position were divided pointwise by the 
corrected standard reflectance spectmm at the corresponding fiber position. 

Before the probe was used it was disinfected witii Metricide (Metrex Research 
Corp.) in accordance with standard protocol. The probe was then guided into the oral 
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cavity and its tip positioned flush with the mucosa. Then reflectance spectra were 
measured. 

Reflectance spectra were measured from 9 volunteers with no history of oral 
cavity neoplasia at 35 clinically noraial sites in the oral cavity (see Table 3). No 
biopsies were obtained from volunteers. Following visual screening in 11 patients 
with a known or suspected premalignant or malignant oral cavity lesion, reflectance 
spectra were measured from 27 sites. The physician placed the fiber optic probe on a 
lesion or suspected lesion and the reflectance of that site was measured. In addition to 
the three to five visually abnormal sites, reflectance spectra were measured from one 
to three contralateral normal sites. Post-spectroscopy. abnormal sites were tattooed 
with India Ink where the probe measured the spectra. A clinical diagnosis of each 
lesion as normal, abnormal (not dysplastic). abnormal (dysplastic) or cancerous was 
recorded by an experienced head and neck surgeon (AMG) or dental oncologist (RJ). 
During follow up surgery, a 2-4 nmi biopsy of the.tissue. was taken from the tattooed 
area. These specimens were evaluated by an experienced pathologist (BK) using light 
microscopy and classified as normal, mucosal reactive atypia (MRA). dysplasia or 
cancer using standard diagnostic criterion. Biopsies with mulUple diagnoses were 
classified according to the most severe pathological diagnosis. The pathologist and 
clinicians were blinded to the results of the spectroscopic analyses. 
Data Analysis ; • ~ 

Reflectance spectra were fiirther processed to reduce noise. A moving average 
with a width of 10 nm was applied to each specimm; following this, intensities of all 
reflectance spectra were exacted in 5 nm steps fi^^ 

analyzed. In addition, the first (slope) and second derivatives of the reflectance spectra 
were calculated between 400 and 580 nm in 5 nm' steps. 

An exploratory data analysis was carried out to determine which source- 
detector separations and wavelength regions were usefol to separate three tissue 
categories: normal, dysplasia and cancer. The normal class' contained sites which were 
clinically and/or histologically normal as well as benign changes such as 
inflammation. 
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For each diagnostic category (normal, dysplasia, cancer) the inventors 
calculated the average value and standard deviaUon of the intensity at each 
wavelength, and the first and second derivative at each wavelength. These values were 
calculated separately for each source detector separation. The Student's t-test was 
used to determine whether differences in these mean values were statisticaUy 
significant between groups of two categories. The inventors examined nomial tissues 
vs. abnormal tissues (dysplasia and cancer) as well as normal tissues vs. dysplasia. 

Parameters which were most statistically significant, corresponding to the 
lowest p-values. were examined further for diagnostic ability. The inventors 
consmicted two-dimensional scatter plots which showed the most statistically 
significant parameter values for each site measured to determine which parameters 
could most effectively discriminate between the two categories of normal and 
abnormal (dysplasia and cancer). All calculations and graphs were produced with the 
Matlab® (Mathworks Inc.) and the Statistical Toolbox for Matlab. 
Results 

Figures 38 through 40 show the reflectance spectra, first and second derivative 
at each of the three source detector separations for all sites measured. Figures 41 
tiu-ough 43 show the average value plus and minus one standard deviation for normal, 
dysplastic and cancer sites. Normal sites are shown iri green, dysplasia in blue and 
cancer in red. In general. Uie specU-a of cancer sites show the highest reflectance 
intensity at all wavelengths measured, while specti-a of normal and dysplastic sites are 
lower in intensity and more similar. Differences in intensity are greatest at position 1 
and least at position 3. The slope and second derivative of the reflectance spectra are • 
greater (lower) for cancers at 440 and 480 nm (520 nm). 

Figure 44 shows the p values comparing the mean intensity, mean first and 
second derivatives of normal tissue versus abnormal tissues, at each wavelength at the 
three different source detector separations. Figure 45 shows tiie p values comparing 
the mean intensity, mean fu^t and second derivatives of normal tissue versus 
dysplastic tissues, at each wavelengtii at the three different source detector 
separations. A low value indicates a statisticaUy significant result; the inventors are 
panicularly interested in tiiose witii values less tiian 0.05. 
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At each source-detector fiber separation, the inventors raniced the intensity, 
first and second derivatives at each wavelength in order of increasing p-value. Tables 
4-6 show the results when normal and abnormal tissues were compared. Tables 7-9 
show the results when normal and dysplastic tissues were compared. Results are 
shown for p- values less than or equal to 0.05. 

In order to explore the diagnostic contributidns provided by these wavelength 
regions, the inventors highlighted all regions where the p-value was less than or equal 
to 0.01 for first and second derivatives and less than or equal to 0.02 for intensity. 
These values are highlighted in gray in tables 4-9. This resulted in a total of 15 
different parameters. The slope and second derivaUve near 440-460 nm at positions 1 
and 2 were identified as diagnostically useful regions, as was the slope and second 
derivative near 500-510 nm at position 3. The intensity from 450-51 nm and 570-585 
nm at position 2 were also identified as diagnostically usefiil. 

Two dimensional scatterplots containing ^1 possible, pairwise combinations of 
these 15 groups of parameters were generated (105 total combinations). Figures 46- 
48 show three representative examples. Figure 46 shows the second derivative at 430 
nm for position 2 vs. the second derivative at 495 nm for position one. The straight 
line represents an algorithm to separate normal findings from dysplasias and cancers, 
and results in a sensitivity of 80% and a specificity of 85%. Figure 47 shows the 
second derivative at 450 nm for position 1 vs.- the first derivative at 510 nm for 
position tiiree. The straight line represents an algorithm; to separate normal findings 
fi-om dysplasias and cancers, and results in a sensitivity of 80% and a specificity of 
82%. Figure 48 shows the second derivative at 410 nm for position 1 vs. tiie first 
derivative at 510 nm for position three. The sti^ght line represents an algorithm to 
separate normal findings from dysplasias and cancers, and results in a sensitivity of 
70% and a specificity of 75%. In each case, the lines were drawn to minimize the 
total number of samples misclassified. These sensitivity and specificity values are 
slighUy lower than tiiose achieved in the previous; section using fluorescence alone, 
and reflect the greater overiap in the reflectance of tissues from the three groups than 
is seen in Uie fluorescence spectra. However, the fluorescence algorithms were based 
on multi-variate classifiers to enable the use of more than two parameters in the 
algorithm. These techniques were next pursued using reflectance spectra. 



W099/57529 PCr/US99/09768 

85 

Multi-Variate Discriminant Algorithms 

Reflectance spectra were analyzed to determine which wavelength ranges and 
source-detector fiber separations contained the rnost diagnosticalty 
and to estimate the perfonnance of multi-variate diagnostic algorithms based on this 
information. The inventors considered algorithms based on multi-variate discriminant 
analysis. First, the inventors developed algorithins based on reflectance spectra, or 
their first or second derivatives over various '^avelength ranges at each source- 
detector fiber separation in order to determine which types of spectra, wavelength 
ranges and fiber separations contained the most diagnostic information. In addition, 
tiie inventors developed algoritimis using the concatenated spectra (or then- first or 
second derivatives) at all fiber separations over various wavelength ranges. In each 
case, the algorithm development process, described in detail below, consisted of the 
following major steps: (1) data reduction to reduce the dimensionality of tiie data set. 

(2) feature selection and classification to develop algorithms which maximized 
diagnostic performance and minimized the likeUhood of over-training in a training set. 

(3) unbiased evaluation of tiiese algorithms using tire technique of cross-validation. 
Diagnostic Categories ' ■■ - 

Multi-variate discriminant algoritiirhs were sought to separate two tissue 
categories: normal and abnormal. The abnormd class conUined sites wiUi dysplasia, 
carcinoma in situ and squamous cell carcinoma; the nomial class contained sites 
which were clinically and/or histologically normal as well as benign changes such as 
inflammation. 

Algorithm Development 

For each of the different types of spectra and wavelengtii ranges, spectra from 
the entire data set were used as a training set to develop multi-variate algorithms to 
separate normal and abnormal tissues based on their reflectance. Algoritimi 
development included two steps: (1) data reduction and (2) development of a 
classification algorithm which maximized diagriostic performance. For each type of 
data, principal component analysis was performed using the entire dataset and 
eigenvectors accounting for 65. 75. 85. 95% and 99% of the total variance were 
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retained. Principal component scores associated with these eigenvectors were 
calculated for each sample. Discriminant functibiis were then fonned to classify each 
sample as normal or abnormd. m classificafe^ 

distance, which is a multivariate measiffe of the separation of a point from a dataset in 
n-dimensional space. Each sample was held out one at a time and the Mahalanobis 
distances between to the held out sample and the remaining normal and abnormal 
samples were calculated; the sample was classified according to the category 
corresponding to the smallest distance. The sensitivity and specificity of the 
algorithm were then evaluated relative to diagnoses based on histopathology (in 
patients suspected to have oral cavity malignancy) or clinical impression (in normal 
volunteers). Overall diagnostic performance was evaluated as the sum of the 
sensitivity and the specificity, thus minimizing the number of misclassifications 
(when prevalence of disease and normal are approximately equal). The performance 
of the diagnostic algorithm depended on the principal component scores which were 
included. Five different diagnostic algorithms were developed using principal 
component scores derived from eigenvectors accounting for increasing amounts of 
total variance. From the available pool of principle :Component scores, the single 
principal component score yielding the best initial performance was identified, and 
then the principal component score that most improved this performance was selected. 
This process was repeated until performance was no longer improved by the addition 
of principal components scores, or aU available scores were selected. The pool of 
available eigenvectors is specified by a variance criterion, eigenvector significance 
level (ESL), that represents the mimmum variance fraction accounted for by the sum 
of the n largest eigenvalues. In this work the inventors examined 5 ESLs, 
corresponding to 65%, 75%. 85%, 95% and 99% of the total variance. 
Comparing Performance of Various Data Types and Wavelength Ranges 

At each ESL. wavelength range and type of data the inventors calculated the 
sum of sensitivity and specificity. As the ESL approaches 100%. over-training 
becomes more likely, since the available pool of eigenvectors will account for nearly 
100% of the variance, including variance due to noise. The magnitude of 
diagnostically important variances is unknown. The risk of over-training risk was 
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assessed for each of the types of input data, by cornparing the training set performance 
to the performance of an algorithm developed from the same data after the diagnoses 
corresponding to each measurement site had been randomized. This provides a 
dataset with the same variance stiucture as the originki dataset. but where the 
diagnostic performance is not expected to exceed that of chance. In order to make 
equivalent comparisons, the disease prevalence in the real sample was maintained in 
the randomly assigned diagnoses. Diagnostic algorithms were then developed again 
which minimized the number of misclassifiecl.. samples at a specified eigenvector 
significance level (ESL). Random diagnoses were assigned fifty times for each 
wavelength combination and the average and standard deviation of the sum of the 
sensitivity and specificity were calculated. Ideally, for completely normally 
distributed data, the sum of the sensitivity and specificity should be one for the 
randomized diagnosis at all levels of training significance. However, if over-training 
occurs, this sum will be greater than one. At each 3$L, wavelength range and type of 
data the inventors calculated the absolute difference between the training set 
performance and random diagnosis assignment. This method allows the best types of 
data and wavelength ranges to be identified based on their robustness, or lack of 
propensity to over-train. Unlike our. analysis of the fluorescence from oral cavity, in 
this case, all sensitivity and specificity values were calculated for the case of cross- 
validaUon. This proved to be necessary since, the eigenvectors which contained 
diagnostically useful information contributed a relatively smaller amount of the total 
variance for reflectance than for fluorscence. The largest differences, were selected as 
the optimal data type and wavelength range. This- criterion selects the data type and 
wavelength range that is least prone to over-training. 
Results - Multi-Variate Discriminant Algorittims 

Tables 10-12 show the absolute difference between the training set 
performance and random diagnosis assignment for the differem data types, 
wavelength ranges and ESLs. The inventors selected an improvement of 0.5 as 
significant for first and second derivative data and an improvement of greater than 0.4 
as significant for intensity data (since this is easier to measure in a mulU-spectral 
imaging system). Wavelength ranges, data types and ESLs with at least this 
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improvement are highUghted in Tables 10-12. Eight types of data met these ctiteria; 
however, the wavelength range associated with several of them overlapped 
significantly. In this case, the combination with the best performance increase was 
selected. «.sulting in the fbUowing four combinations: (1) Jntehsity at position 2 from 
395-475 mn at 95% ESL. (2) Intensity at positions 1-3 from 425-500 mn at 99% ESL. 
(3) Slope at position 1 from 450-525 nm at 65% ESL and (4) Slope at position 3 from 
395-550 nm at 95% ESL. Table 13 gives the cross-vaUdated sensiUvity and 
specificity for algorithms based on these data types, wavelength ranges and ESLs. 
The best performance was achieved using the slope at position 3 from 395-550 nm at 
95% ESL. with a cross-validated sensitivity of 70% and a specificity of 100%. This 
compares favorably to tiie scatter plot shown in Figure 47. which shows the second 
derivative at 450 nm for position 1 vs. the slope at 510 nm for position three, where a 
simple linear discriminant algorithm resulted in a sensitivity of 80% and a specificity 
of 82%. 

Head and Neck Analysis- Combination cf Fluorescence and Reflectance 

In general, the performance of multi-variate' algorithms based on reflectance 
spectroscopy alone was somewhat lower tiian that b^ed on fluorescence spectroscopy 
alone. However, from an instrumentation point of view, it may be easier to measure 
reflectance images and spectra since signal to noise ratio is higher. Therefore, the 
inventors explored tiie combination of reflectance and fluorescence spectroscopy and 
wheter it may provide better discrimination. Fuithfer. tiie inventors examined whether 
the good performance of the fluorescence algorithm may be maintained if ti.e number 
of fluorescence excitation wavelengths were itduced/ but ^ectance spectra were V 
measured. 

In our previous analyses, the inventor identified a combination of emission 
spectra at three excitation wavelengtit as optimal for diagnosis based on fluorescence 
spectroscopy and four types of reflectance data which were optimal for diagnosis. The 
inventors evaluated the performance of tiie following combinations of data at ESLs of 
65%. 75%. 85%. 95% and 99%: (a) Fluorescence at three excitation wavelengths + 
each type of reflectance data, (b) Fluorescence at all combinations of two excitation 



wo 99/57529 PCT/US99/09768 
89 

wavelengths + each type of reflectance data, and (c) Fluorescence at each single 
excitation wavelength + each type of reflectance data. 

The performance of these combinations was compared to that which could be 
achieved with fluorescence alone. Since the qurnber of samples where both 
5 fluorescence and reflectance data were available was smaller than that for either type 
of data alone, the inventors re-evaluated the performance of algorithms based on 
reflectance or fluorescence data alone using this reduced dataset. The inventors also 
evaluated the performance of fluorescence alone at one or two excitation wavelengths 
using this reduced dataset. Table 14 shows the number of patients and sites where 

10 both reflectance and fluorescence data were available. Results, reported as sensitivity 
and specificity giving best performance under cross vaUdation, are shown in Tables 
15-18 for each type of reflectance data. 

The performance of the fluorescence algorithm based on three excitation 
wavelengths does not improve when any of the four types of reflectance data are also 

15 incorporated. The performance of fluorescence algorithms based on two excitation 
wavelengths was lower than that for three excitation wavelengths; incorporation of 
any of the four types of reflectance spectra^.did not improve performance. The 
performance of fluorescence algorithms based ph a single excitation wavelength was 
lower than that for two and three excitation wavel.engths. Best results were obtained 

20 using spectra at 400 nm excitation. Incorporation of any of the four types of 
reflectance specu-a did not improve performance. 

All of the methods and apparatus disclosed and claimed herein can be made 
and executed Avithout undue experimentation in light of the present disclosure. While 
the apparatus and methods of this invention have been described in te^ 

25 embodiments, it will be apparent to those of skill in the art that variations may be 
applied to the methods and/or apparatus described, herein without departing from the 
concept, spirit and scope of the invention. 
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1 . An apparatus for performing fluorescence and spatially resolved reflectance 
spectroscopy on a sample, comprising: 

5 a light source; 

a monochromator in optical commimication with said light source; 
a reflectance illumination fiber in optical communication with said light 
source; 

a fluorescence excitation fiber in optical communication with said 
10 monochromator, 
an imaging spectrograph; 

a fluorescence collection fiber in optical conununication with said imaging 
spectrograph; 

a reflectance collection fiber in optical communication with said imaging 
15 spectrograph and in spaced relation with said reflectance illumination 

fiber; and 

and a detector in optical communication with said imaging spectrograph. 

2. The apparatus of claim 1, wherein said light source comprises a Xe arc lamp. 

20 

3. The apparatus of claim 1, wherein said monochromator comprises a double 
monochromator. 

4. The apparatus of claim 1 , wherein said detector comprises a thermo-electricaliy 
25 cooled CCD camera. 

5. The apparatus of claim 1, wherein said fluorescence excitation fiber and said 
fluorescence collection fiber are integral. 

30 6. The apparatus of claim 1 , wherein one or more of said fibers are positioned flush 
with said sample. 
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7. The apparatus of claim 1 , further comprising a spacer positioned between one or 
more of said fibers and said sample. 

8. The apparatus of claim 1 , wherein said reflectance illumination fiber, said 
fluorescence excitation fiber, said fluorescence collection fiber, and said reflectance 
collection fiber define a fiber optic probe. 

9. The apparatus of claim 8. wherein said probe is configured to be positioned within 
a trocar. 

10. The apparatus of claim 8, wherein said probe comprises a center section and an 
outer section, said fluorescence excitation fiber and said fluorescence collection fiber 
being positioned in said center section, and said reflectance illumination fiber and said 
reflectance collection fiber being positioned in said outer section. 

1 1 . The apparatus of claim 1 , comprising a plurality of fluorescence excitation and 
coUection fibers arranged in a circular bundle. : .. . . , , , : 

12. The apparatus of claim 1, comprising a plurality of reflectance collection fibers 
defining a plurality of collection positions. 

13. The apparatus of claim 12, wherein said plurality of collection positions are 
spaced between about 0 and about 10 millimeters from said reflectance illumination 

■■■fiber.\ /■ 

14. The apparatus of claim 1, wherein said reflectance collection fiber defines a 
collection position at about 180 degrees relative to said reflectance illumination fiber. 

15. The apparatus of claim 1. wherein said reflectance collection fiber defines a 
collection position at about 90 degrees relative to said reflectance iUumination flber. 
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16. The apparatus ofclaiml, wherein said reflectance collection fiber defines a 
collection position at about 45 degrees relative to said renectance illuminaUon fiber. 

17. The apparatus of claim 1, further comprising a one or more fibers in optical 
communication with said light source and configured to illuminate said sample during 
operation of said apparatus. 

18. The apparatus of claim 1, comprising a plurality of fluorescence excitation fibers 
arranged in one or more rows adjacent said monochromator. 

19. The apparatus of claim 1. comprising a plurality of fluorescence excitation fibers 
and a plurality of reflectance collection fibers arranged in a single row adjacent said 

. imaging spectrograph. 

20. The apparatus of claim 19, further comprising one or more unconnected fibers 
interspersed with said plurality of fluorescence excitation fibers and said plurality of 
reflectance collection fibers. 

21. The apparatus of claim 1, ftirther comprising a fiber connected from said light 
source to said imaging spectrograph to monitor spectral output of said Ught source. 

22. The apparatus of claim 1, further comprising a controller coupled to said detector. 

23. An apparatus for measuring fluorescence and spatially resolved reflectance 
spectra of a sample, comprising: 

a light source; 

a monochromator in optical communication with said light source; 

a fiber optic probe in optical communication with said light source and with 
said monochromator, said probe comprising a plurality of fluorescence 
excitation and collection fibers in spaced relation and a plurality of 
reflectance collection fibers in spaced relation with a reflectance 
illumination fiber; 
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an imaging spectrograph in optical communication with said plurality of 
fluorescence collection fibers and with said plurality of reflectance 
collection fibers; and 

a detector in optical communication with said imaging spectrograph. 

24. The apparatus of claim 23, wherein said plurality of reflectance collection fibers 
and said reflectance illumination fiber are positioned concentrically about said 
plurality of fluorescence excitation and collection fibers. : 

25. The apparatus of claim 23, wherein at least one of said plurality of reflectance 
collection fibers defines a collection position at about 180 degrees relative to said 
reflectance illumination fiber. 

26. The apparatus of claim 23, wherein at least om^: of said plurality of reflectance 
collection fibers defines a collection position at about 90 degrees relative to said 
reflectance illumination fiber. 

27. The apparatus of claim 23, wherein at least one of said plurality of reflectance 
collection fibers defines a collection position at aljput 45 degrees relative to said 
reflectance illumination fiber. 



28. The apparatus of claim 23, wherein said plurality of collection positions are 
spaced between about 0 and about 10 millimetere from said. re;flectance illumination 
■fiber." 

29. The apparatus of claim 23, wherein said probe comprises between twenty-one and 
forty-six optical fibers. 

30. A method for combined fluorescence and spatially resolved reflectance 
spectroscopy of a sample, comprising: 

directing radiation to said sample with a fluorescence excitation fiber; 
collecting radiation from said sample with a fluorescence collection fiber. 
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directing said radiation from said sample to an imaging spectrograph and a 
detector; 

illuminating said sample With a reflectance illumination fiber; 

collecting reflected light from said sample with a reflectance collection fiber in 

spaced relation with said reflectance illumination fiber; and 
directing said reflected light from said sample to an imaging spectrograph and 

a detector. 

31. The method of claim 30. wherein said collecting reflected light comprises 
collecting reflected light from a plurality of collection positions with a plurality of 
reflectance collection fibers. 

32. The method of claim 30. wherein said collecting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a coUecUon position at about 180 degrees relative to said reflectance illumination 
fiber. 



33. The method of claim 30. wherein said collepting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a collection position at about 90 degrees relaUve to said reflectance illumination fiber. 

34. The method of claim 30, wherein said collepting reflected light comprises 
collecting reflected light from said sample with a reflectance collection fiber defining 
a collection position at about 45 degrees relative to said reflectance Ulumination fiber. 

35. The method of claim 30. wherein said sample comprises ovarian, head and neck, 
or cervical tissue. 



30 



36. The method of claim 30, further comprising analyzing spectral data from said 
detector to characterize said sample. 
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37. The method of claim 36. wherein said analyzing comprises pre-processing said 
data and reducing a dimension of said data using principal component analysis. 

38. The method of claim 37, wherein said analyzing further comprises selecting one 
5 or more diagnostic principal components of said data and forming one or more 

algorithms. 

39. The method of claim 38, wherein said analyzing further comprises forming one or 
more composite algorithms. 

10 

40. The method ofclaim 38, wherein said analyzing further comprises evaluating at 
least on of said algorithms using a cross-validation technique. 

41. A method for combined fluorescence and. spati.ally resolved reflectance 
15 spectroscopyof a sample, comprising: * ; • 

directing radiation to said sample with a fluorescence excitation fiber; 
collecting radiation from said sample with a fluprescepce collection fiber; 
directing said radiation from said sample to an imaging spectrograph and a 
detector; 

20 illuminating said sample with a reflectance illumination fiber; 

collecting reflected light at a plurality of collection positions from said sample 
with a plurality of reflectance collection fibers aaanged in spaced 
relation; 

directing said reflected light from said sanjple to an imaging spectrograph and 
25 a detector to produce spectral data; , . 

pre-processing said data; and 

reducing a dimension of said data using principal component analysis. 
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42. The metiiod of claim 4 1 , further comprising selecting one or more diagnostic 
principal components of said data and forming one or more algorithms. 
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43. The method of claim 42. ftirther comprising forming one or more composite 
algorithms. 

44. The method of claim 43, ftirther comprising evaluatmg at least one of said 
algorithms using a cross-validation technique. - 

45. A method for analyzing spectroscopy data to defirie-an bptimized reduced data 
set, comprising: 

pre-processing said spectroscopy data; 

reducing a dimension of said spectroscopy data using principal component 
analysis; and 

selecting one or more diagnostic principal components of said spectroscopy 



46. The method of claim 45. wherein said spectroscopy data comprises combined 
fluorescence and spaUally resolved reflectance spectroscopy data. 

47. The method of claim 45, wherein said pre-processing comprises normaUzation of 
said spectroscopy data. 

48. The method of claim 45, wherein said pre-processing comprises mean scaling 
said spectroscopy data. 

49. The method of claim 45, wherein said pre-processing comprises calculating one 
or more derivatives on said spectroscopy data. 

50. The method of claim 45, ftirther comprising eliminating redundant data from said 
spectroscopy data. 

5 1 . The method of claim 45, ftirther comprising forming one or more algorithms and 
evaluating at least one of said algorithms using a cross validaUon technique. 
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52. The method of claim 51, further comprising forming one or more composite 
algorithms. 
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